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Current upper bounds of the neutron electric dipole moment constrain the physically observable 
quantum chromodynamic (QCD) vacuum angle |0| < 10 !!. Since QCD explains vast experimental 
data from the 100 MeV scale to the TeV scale, it is better to explain this smallness of |0| in the 
QCD framework, which is the strong CP problem. Now, there exist two plausible solutions to this 
problem, one of which leads to the existence of the very light axion. The axion decay constant 
window, 10? GeV < Fu € 10? GeV for a O(1) initial misalignment angle 61, has been obtained 
by astrophysical and cosmological data. For Fa > 10'* GeV with 04 < O(1), axions may constitute 
a significant fraction of dark matter of the universe. The supersymmetrized axion solution of the 
strong CP problem introduces its superpartner the axino which might have affected the universe 
evolution significantly. Here, we review the very light axion (theory, supersymmetrization, and 
models) with the most recent particle, astrophysical and cosmological data, and present prospects 


for its discovery. 


PACS numbers: 14.80.Mz, 12.38. Aw, 95.35.+d, 11.30.-j 


I. OVERVIEW 


Strong interaction phenomena has revealed that the 
discrete symmetries of charge conjugation C, parity 
TP and time reversal 7 are separately good symmetries 
of nature. Therefore, quantum chromodynamics(QCD) 
based on the gauge group SU(3). BRA, Han, Nambu 
(1965), Bardeen, Fritszch, Gell-Mann (1972)] must re- 
spect any combinations of these discrete symmetries C, 
P and T to be accepted as the theory of strong inter- 
actions. Among these discrete symmetries, the CP sym- 
metry is not necessarily respected in QCD due to the 
nonzero QCD vacuum angle 0, an issue known as the 
“strong CP problem”. Since QCD is so successful phe- 
nomenologically, any possible solution to "the strong 
CP problem" is expected to be realized in nature. Cur- 
rently the most attractive solution leads to the existence 
of a very light axion [135,224,831 B79, Kim (1979), Shif- 
man, Vainstein, Zakharov (1980), Dine, Fischler, Sred- 
nicki (1981), Zhitnitskii(1981)]. Searches for QCD ax- 
ions generated from the Sun | Bos, Andriamonje et. 
al. (CERN Axion Search Telescope (CAST) Collabo- 
ration, 2007), Inoue et. al. (Tokyo Axion Helioscope 
Collaboration, 2008)] and remnant axions from the early 
universe bd. [xd Rosenberg (Axion Dark Matter Ex- 
periment(ADMX) Collaboration, 2004), Carosi (ADMX 
Collaboration, 2007)] are presently ongoing. 


The story of axions started with the QCD U(1) prob- 
lem B58, Weinberg (1975)] which is now understood, 
having been solved by the ’t Hooft determinental inter- 
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action [844, B49, 't Hooft (1976,1986)]. The determinen- 
tal interaction is shown pictorially as the left diagram of 
Fig. [and the solution is shown as the shaded right dia- 
gram. The strong interaction makes the quark bilinears 
condense with the vacuum expectation value (VEV) of 
order v ~ 260 MeV. The phase of this interaction 6 origi- 
nates from the QCD vacuum angle which is known to be 
physical 53, Callan, Dashen, Gross (1976), Jackiw, 
Rebbi (1976)] and contributes to the neutron electric 
dipole moment (NEDM) by an order 0 times the neutron 
size, which is absurdly large. Peccei and Quinn(PQ) ob- 
served that there exists a way to make 0 just a phase by 
introducing a symmetry, now called U(1)po, and hence 
physical amplitudes do not depend on 0 as in the massless 
quark case bor, Peccei, Quinn (1977)]. In the stan- 
dard model (SM), this phase is a pseudoscalar Goldstone 
boson called the ‘axion’ in the multitude of Higgs fields 
as noted in [360, B67, Weinberg (1978), Wilczek (1978)]. 
If the PQ idea is completed with Fig. [I] this axion is 
exactly massless (but observable), and 0 would behave 
‘unphysically’ by the freedom of choosing an appropriate 
axion VEV, which was the original PQ idea. However, 
there exists subleading terms, proportional to one power 
of mq, which close the quark lines with the current quark 
mass instead of a condensation. Then, an axion poten- 
tial develops, and the axion becomes a pseudo-Goldstone 
boson. The axion solution of the strong CP problem is 
cosmological in that the axion VEV chooses 0 = 0 at the 
minimum of this axion potential. The currently allowed 
axion is very light and long lived. 


The properties of the axion (denoted as a) are chiefly 
given by its decay constant Fa which sets the scale of non- 
renormalizable axion interactions through a/Fa. Initial 
axion searches placed Fa far above the electroweak scale 
and additional stringent bounds on F, were obtained 
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FIG. 1: The determinental interaction of light quarks. Chiral 
symmetry breaking introduces the anomalous 7’ mass term 
from the quark condensations. 


from the studies of stellar evolutions and commoloay pad 
Kim (1987)]. Axion astrophysics started by [129 (130, Di- 
cus, Kolb, Teplitz, Wagoner (1978, 1980)], using earlier 
ideas from E24 , K. Sato, H. Sato (1975), K. Sato 
(1978)], now gives a stringent lower bound on the de- 
cay constant, F > 0.5 x 10? GeV, from the study of 
SN1987A Bod. , Turner (1990), Raffelt (1990)]. With 
this large decay constant, the axion flux from the Sun is 
a small fraction of the solar neutrino flux, but still may 
be detectable by the CAST experiment and by the Tokyo 
helioscope. 


It is known that very light axions with F, in the 10!” 
GeV region (axion mass in the micro-eV range) can com- 
pose some part of cold dark matter(CDM) in the un- 
verse [33 Preskill, Wise, Wilczek (1983), Ab- 
bott, Sikivie (1983), Dine, Fischler (1983)]. The exact 
amount of the axion CDM depends on the initial ax- 
ion misalignment angle 0, at the time of axion creation 
when the universe temperature was around the axion de- 
cay constant, T ~ Fa. This observation puts the very 
light axion on the list of leading CDM candidate parti- 
cles. If indeed these cosmic axions compose a significant 
fraction of CDM in the universe, they may be detectable 
by collecting axion-converted photons in the cavity type 
detectors Sikivie (1983)] as tried by [12d. sd De- 
Panfilis et al. (RBF Collaboration, 1987), Hagmann et 
al. (Univ. of Florida, 1990)] and now continuing at the 
ADMX experiment. 

Cosmology with CDM was the leading candidate of 
the early universe in the 1980s lid bad, , Blumen- 
thal, Faber, Primack, Rees (1984), Kolb, Turner (1990), 
Weinberg (2008)]. Since then this view has given way to 
the new cosmology with the discovery of dark energy (DE) 
in 1998 , Perlmutter et al. (1998), Riess et. al. 
(1998)]. The current view of the dominant components 
of the universe is Qcpm œ 0.23 and Qa c 0.73 with 
only a few percent consisting of baryons , Spergel 
et al. (Wilkinson Microwave Anisotropy Probe(WMAP) 
Collaboration, 2007)]. The most plausible dark matter 
(DM) candidates at present are the lightest supersym- 
metric(SUSY) particle(LSP), the axion, the axino, and 
the gravitino. Here, we will review mostly on the axion 
and its CDM-related possibility. 


'The need for DM was suggested as early as the 1930s 
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B34 B8il, Zwicky (1933), Smith (1936)]. Since then, evi- 
dences of nonluminous DM in the universe has been accu- 
mulating: examples of which include flat galactic rotation 
curves, Chandra satellite photos, and gravitational lens- 
ing effects. If the galactic bulge is the dominant mass in 
the galaxy, the velocity v of a star located at r from the 
center should rotate with v ~ r-!/?, But the observed 
flat rotation curve See, for example, McGaugh et al. 
(2007), and references therein] violates this expectation 
and implies an extended mass in the halo as p(r) ~ 1/r?. 
Also, the Chandra observation of X-ray and gravitational 
lensing images implies this matter profile around the bul- 
let cluster , Clowe et al. (Chandra Collaboration, 
2006)]. Circular gravitational lensing images Bog, Jee et 
al. (2007)] also support the existence of DM. The DM 
density around us the Solar system is usually taken as 
PDM © (0.3 A 0.45) GeV/cm?. 

Current CDM candidates belong to either incoherent 
particles or coherent oscillations of spin-0 fields. From 
this disctinction, the bosonic collective motion such as 
the axion can be considered as CDM. The popular inco- 
herent CDM particles are the weakly interacting massive 
particles: WIMP) or decay products of WIMPs. A more 
frequently used independent distinction is thermal relics 
and nonthermal relics, but there is no strict correspon- 
dence relation between the incoherent-coherent particles 
and the thermal-nonthermal relics. WIMPs are massive 
particles with weak interaction cross sections, first dis- 
cussed in terms of a heavy neutrino, corresponding to 
the RHS crossing point of Fig. Bia) , Lee, Weinberg 
(1977)]. The LHS crossing point corresponds to 10 eV 
neutrino [ud bod Marx, Szalay (1972), Cowsik, Mc- 
Clelland (1972)]. WIMPs, such as the LSP, are thermal 
relics when their number density is determined by the 
freezeout temperature and are called nonthermal relics if 
their number density is determined by the other mech- 
anism such as by the decay of heavier relics lod Choi, 
Kim, Lee, Seto (2008). In Fig. Db), we sketch the 
axion energy density in terms of the axion mass. The 
shape is a flipped one from that of Fig. Ba), because in 
the axion case the low mass and high mass regions con- 
tributes Q, from different physics, one from the vacuum 
misalignment and the other from the hot thermal relics. 

In addition to the heavy neutrino, SUSY with R-parity 
conservation allows the LSP to be just such a WIMP par- 
ticle. The LSP interaction is “weak” since the interaction 
mediators (SUSY particles) are supposed to be in the 100 
GeV range. For a WIMP to be a successful CDM candi- 
date, usually the interaction cross section at the time of 
decoupling needs to be around pad Bad Kolb, Turner 
(1990), Spergel et al. (WMAP Collaboration, 2007)] 


ES —26 ,3,-1 
(Gin) lat decoupling ~ 0.2 x 10 cm s , 


with Q,,h? ~ 0.113 + 0.009. (1) 


This is roughly the cross-section for the LSP from low 
energy SUSY, which is the reason why the DM commu- 
nity is so interested in the WIMP LSP. Some superweakly 
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FIG. 2: The Lee-Weinberg type plots for (a) the neutrino 
Q, n? pad and (b) the axion Qah?, where h is the present 
Huble constant in units of 100 kms"! Mpc^!. The dash line 
of (a) is for Q,h? = 0.113. In (b), it corresponds to the 
hadronic axion. The blue and red dashlines correspond to 
the CDM and hot DM limits, respectively. 


interacting particles such as gravitinos, axinos, and wim- 
pzillas , Chung, Kolb, Riotto (1999)] can be CDM 
candidates as well, but their cross sections do not fall in 
the range of Eq. (Ij. The CDM candidate particles are 
shown in the o; versus mass plane in Fig. B] with mi- 
nor modification from that of , Roszkowski (2004)]. 
'The incoherent fermions, such as the neutrino and left 
ends of the bars of the axino and gravitino correspond 
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FIG. 3: Some proposed particles in the interaction cross sec- 
tion versus the corresponding particle mass m; plane. The 
skeleton is taken from Ref. . The dashed curves repre- 
sent schematic shapes of Q; versus the corresponding particle 
mass m;. The small red square-box corresponds to the hot 
DM hadronic axion. Two small outside squares (in the cyan 
and blue colors) in the axion region are marked to show the 
plausible (GUT and CDM) axions, respectively. The abun- 
dances of heavy axino, gravitino and wimpzilla depend on 
how inflation ends. 


to the left crossing points of Fig. Da). The rest, except 
the axion, corresponds more or less to the right cross- 
ing points of Fig. Dfa) with the reheating after inflation 
considered if necessary. Currently, there are experimen- 
tal efforts to discover the LSP as predicted by SUSY 
models. Direct cosmological searches are also ongoing 
[1 full B3 4d, Bil Ead. Rod 25d, Jungman, Kamionkowski, 
Griest (1996), Bertone, Hooper, Silk (2005), Bernabei et 
al. (DAMA collaboration, 2003, 2008), Lee et al. (KIMS 
Collaboration, 2007), Angle et al. (XENON Collabora- 
tion, 2008), Behnke et al. (COUPP Collaboration, 2008), 
Ahmed et al. (CDMS Collaboration, 2008)]. At the 
LHC, the probable LSP mass ranges will be looked for 
by the neutralino decay producing the LSP. 


It is known that density perturbations began growing 
much earlier than the time of recombination in order to 
become large enough to form galaxies in the young uni- 
verse. For galaxy formation, therefore, DM is needed 
since proton density perturbations could not grow before 
the recombination time, but DM perturbations could. 
With DM, the equality point of radiation and matter 
energy densities can occur much earlier than the recom- 
bination time since DM is not prohibited in collapsing 
by Silk damping B38, Silk (1968)]. If the WIMP mass 
and interaction cross section fall in the region allowed by 
Eq. (I), it can be CDM. If the LSP is the only CDM 


component, then the LSP mass would give one number 
for the DM density, which may not be the case. Thus, 
even if the LSP is contributing to the CDM density, we 
may need the axion to account for the correct amount 
of CDM around us. This is possible in the anthropic 
scenario of very light axions because it is equally prob- 
able for the initial axion misalignment angle 4; to take 
any value between 0 and 7 , Tegmark, Aguirre, Rees, 
Wilczek (2006)]. 


Here we review the axion, which is the 
most interesting Nambu-Goldstone boson isd bd Bad 
Nambu (1960), Nambu, Jona-Lasinio (1961), Goldstone 
(1961)], as well as related issues. In Sect. [I we dis- 
cuss the strong CP problem and its plausible solutions. 
In Sect. [III] we review the most attractive solution giv- 
ing the very light axion and present the axion theory in 
terms of possible axion couplings defined with c1, co, and 
c3 which will be used throughout this review. In Sect. 
[V] we present axion astrophysics and cosmology. Here 
we present a new number for the cosmic axion abun- 
dance in view of the recent more accurate data on light 
quark masses. In Sect. [V] we summarize the axion detec- 
tion ideas and the ongoing axion detection experiments. 
In Sect. [VIl we summarize the proposed very light axion 
models, including superstring axions. Finally in Sect. 
[VIT we briefly discuss cosmology with the axino, the ax- 
ion's superpartner. 


If the axion was observed, it would mark one of the 
most profound elementary particle discoveries because it 
would confirm experimentally the instanton-based argu- 
ments of QCD. In addition, if it were shown to be consis- 
tent with a cosmologically significant amount of axions, 
the CDM idea of the bosonic collective-motion would also 
be confirmed experimentally. If SUSY is correct and the 
axion is the solution to the strong CP problem, it must 
affect the evolution of the universe as well. 


II. THE STRONG CP PROBLEM AND 
SOLUTIONS 


There are good reviews on the strong CP problem 
[rà 227 293, Kim (1987), Cheng (1988), Peccei (1989)], 
and here we outline a few key points. QCD with SU(3). 
gluons is a confining gauge theory with three light quarks 
below 1 GeV and Agcp = 380 + 60 MeV , Groote, 
Kórner, Schilcher, Nasrallah (1998)]. The classical gluon 


field equations have the instanton solution |30, Belavin, 
Ployakov, Schwartz, Tyupkin (1975)], 
1 r 
Gu = if (9a). f')- m. 0) 


where the gauge coupling is absorbed in the gauge field 
and g(x) is a pure gauge form with G,, « 1/r* for a 
large r and p is the instanton size. The (anti-) instanton 
solution satisfies the (anti-) selfduality condition Guy = 


+G uv Which carries the integer Pontryagin index 
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where Ger” = aad Ce The classical solutions with 
q = —~,:::,-1,0,+1,---,+00, introduces a new real 


number 0 which parametrizes the |0} vacuum, 
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NEC (4) 
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Since ns are integers, in view of Eq. (3), 0 is a periodic 
variable with period 27. It has been known that @ is 
an observable parameter 53, 207 bod Callan, Dashen, Gross 
(1976), Jackiw, Rebbi (1976)]. In the 0 vacuum, we must 
consider the P and 7 (or CP) violating interaction 
parametrized by 0 = 09 + Osca! 


= {GG} = 5 


zH nm Gn „Go (5) 
where the curly bracket carries 1/3277, 09 is the angle 
given above the electroweak scale and @weak is the value 
introduced by the electroweak CP violation. This ob- 
servable 0 has led to the so-called strong CP problem 
from the upper bound on the NEDM. For QCD to be- 
come a correct theory, this CP violation by QCD must 
be sufficiently suppressed. 


A. Neutron electric dipole moment 


The interaction (B) is the anomaly term B 7, Adler 
(1969), Bell, Jackiw (1969)] which is the basis for solv- 
asus 't Hooft (1986)] the old U(1) problem of QCD 
358 Weinberg (1975). The important size of instan- 
tons for physics is near the scale where QCD becomes 
strong. In [4d ** Hooft (1976)], 't Hooft has shown 
that the determinental interaction of light quarks carries 
the same global symmetry as that of Eq. @), and it is 
customary to use this light quark determinental interac- 
tion rather than treating the gluon interaction (5). The 
early estimates of NEDM proportional to 0 from the de- 
terminental interaction are 2.7 x 10-190 ecm [23, Baluni 
(1979)] and 3.6 x 107198 ecm [L18, Crewther, Di Vecchia, 
Veneziano, Witten (1979, 1980(E))]. Other estimates in 
different methods are 11 x 10-7190 ecm Cea, Nar- 
dulli (1984)], 1.2 x 10-198 ecm B28 Schnitzer (1984)], 
3 x 107160 ecm nd Musakhanov, Israilov (1984)], and 
5.5 x 107168 ecm [212, Kanaya, ids sam 
prehensive M on the NEDM exist us Dar 
(2000), Pospelov, Ritz (2005)]. Recently, the NEDM has 


1 With the canonical normalization of the gauge field, the RHS of 
Eq. is multiplied by g2. 


ae wa P Ed 
4 ^ 4, ^ 
I LI 4 LI 
——————— a. ss 
1 
x 
(19, 17) 
(a) (b) 


FIG. 4: The insertion of the CP violation effect by VEVs 
of 7° and 7 in (a). They can be transferred to one vertex 
shown as a bullet in (b). With this bullet, the CP violation 
is present by a mismatch between the CP conserving RHS 
vertex and C7 violating LHS vertex. 


qf Nen 
4, ^ 
I Y A 
———$——————————————— 
n p n d 


(a) (b) 


FIG. 5: Diagrams contributing to the NEDM with the bul- 
let representing the CP violation effect. Diagram (a) is the 
physically observable contribution. 


been estimated in the hard wall AdS/QCD model with 
one extra dimension, 1.08 x 107190 ecm Hong, Kim, 
Siwach, Yee (2007)]. 

The diagrams contributing to the NEDM are re- 
stricted. The neutron magnetic dipole moment arises 
at one loop in the chiral perturbation theory. If we 
treat this neutron magnetic dipole moment operator 
Hanom o" " n.F?7 as a vertex, tree diagrams do not con- 
tribute to the NEDM, because the magnetic moment 
term has the same chiral transformation property as that 
of the mass term and hence by redefining an external neu- 
tron field one can remove the phases in the neutron mass 
and in the dipole moment operator together. 

Let the U(1) chiral transformation of quarks in the 
broken phase be encoded in the neutron mass term as 
maf Leo / fa —osn | fa 0/2) s 4 h.c. (ei^? instead of 
eia" because the baryon octet is spin-4). The vacuum 
expectation values(VEVs) of 7° and r/ are calculated in 
Subsec. IIIB] The CP violation is present by a mismatch 
between the C conserving right-hand side(RHS) vertex 
and CP violating left-hand side(LHS) vertex as shown in 
Fig. Wb). The mass term of Fig. Hf b) and the neutron 
magnetic dipole moment term of Fig. DI b) have the same 
chiral transformation property and the phases appearing 
there can be simultaneously removed by redefining ng, 
for example. However, the phase appearing in Fig. Ela) 
cannot be removed by this phase redefinition and this 
contribution is physically observable. Since Fig. Ka) 
is the physically observable NEDM, for proton a similar 
argument leads to the same magnitude and opposite sign 


for the proton electrin dipole moment, i.e. dn + dp = 0. 
Now, we estimate the NEDM as 
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where the CP violating scalar coupling gz ww (the bullet 
of Fig. BXa)) is estimated in Ref. [L15, Crewther, Di 
Vecchia, Veneziano, Witten (1980(E))] as 


= 2(mg — mx)m,ma = 
pm = — 9 ——————————M—— a —0.0230 
IANN fr(Mu + ma)(2m, — m, — Ma) 
(7) 
where we take Z = m, /ma © 0.48, mg ~ 4.9 MeV and 
ms/mq ~ 20.1. From Eq. (8) of Subsec. DIBI we 


estimate the CP violating scalar coupling as 
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Note that Eqs. (7) and @ give a factor of ~10 differ- 
ence. The existing calculations vary within a factor of 10 
ba, [198.. 'These old calculations depend on the various 
approximation methods used, but non of these estimated 
the VEV of 7°. For example, for Eq. (7) Eq. (11) 
of Ref. [13 Crewther, Di Vecchia, Veneziano, Witten 
(1979)] uses the SU(3) symmetric baryon octet coupling 
due to the CP violating interaction. On the other hand, 
for Eq. (E) the ground state vacuum of the mesonic fields 
has been used. After integrating out baryons, we look for 
the vacuum below the chiral symmetry scale. Then, the 
correct vacuum choice adds the value (8) to the value (7). 
But here we choose the one-order-larger value from the 
mesonic vacuum shift value (B) for an order of magnitude 
estimate, not to worry about the signs of the contribu- 
tions. So, we estimate the NEDM as 4.5 x 107150 ecm 


from Eq. 8). 


Since the recent upper bound on the NEDM is |d;,| < 
2.9x 107 ?6ecm PJ, Baker et. al. (2006)], we must require 


[0| < 0.7 x 107". (9) 


This extremely small upper bound on @ has led to the 
so-called ‘strong CP problem’. |0| < 1071 is perfectly 
allowed but its tiny value is not explained given that it 
could have chosen a value anywhere between 0 and ~ 7. 
The strong CP problem is the quest to understand more 
satisfactorily, “Why 0 is so unnaturally small?". 


B. Possible solutions 


In the remainder of this paper, we simplify the nota- 
tion, replacing 0 by 0 since there will not be much con- 
fusion. There are three explanations for the smallness of 


0 in the naturalness framework: 


Case 1. Calculable 6, 
Case 2. Massless up quark, 
Case 3. Axion. 


In this subsection, we discuss Cases 1 and 2 briefly, and 
concentrate on Case 3 in the subsequent sections. 


1. Calculable 0 


Naturalness of a theory with a parameter (3 is elegantly 
defined by ’t Hooft in , t Hooft (1979)]: The theory 
is natural if the symmetry of the theory increases in the 
limit of the vanishing 8. A frequently quoted example 
is the Dirac fermion mass, mur Ug + h.c., where m — 0 
introduces a chiral symmetry v — e'?54j in the theory. 

Regarding the strong CP problem, the appropriate 
symmetry is parity P or CP since the interaction (B) 
violates parity P, time reversal 7 and CP, but con- 
serves charge conjugation C. Requiring CP invariance 
in the Lagrangian is equivalent to setting 0o at zero. 
However, the observed weak interaction phenomena ex- 
hibit the weak CP symmetry violations in the neutral 
K meson system and B — K *«- decay ld. Amsler et al. 
(2008)], and hence the needed introduction of CP viola- 
tion in weak interactions with 09 = 0 must be achieved 
spontaneously. In this process one necessarily introduces 
Oweak part in 0 which can be calculated and required to 
be sufficiently small within the bound given in (9). Along 
this line, many ideas were proposed (2,134, | , Beg, 
Tsao (1978), Mohapatra, Senjanovic (1978), Segre, Wel- 
don (1979), Barr, Langacker (1979)]. This naturalness 
idea may be extended so as to only effect renormalizable 
couplings [165, Georgi (1978)]. In any case, the introduc- 
tion of weak CP violation by spontaneous mechanisms 
Lee (1973)] or by soft scalar masses Georgi 
(1978)] must be checked with various weak phenomena. 
The current weak CP violation data fits nicely with the 
Kobayashi-Maskawa type CP violation Kobayashi, 
Maskawa (1973)], and these drastically different sponta- 
neous weak CP violation ideas are probably difficult to 
fit the data but are not considered ruled out yet 
He (2007)] even though the spontaneous CP violation 
scheme ia Branco (1980)] in the Weinberg model 
Weinberg (1976)] is ruled out bg, Chang, He, McKellar 
(2001)]. It should be noted, though, that the above pro- 
posed models have the difficulty in satisfying the bounds 


However, the Nelson-Barr type weak CP violation 
is mimics the Kobayashi-Maskawa type CP violation 
even though the fundamental reason for CP violation is 


spontaneous [28 Nelson (1984), Barr (1984)]. The 
scheme is designed such that the Yukawa couplings are 
real, ie. 09 = 0 from the CP invariance. Next, the 


spontaneous CP violation is introduced through the sin- 
glet VEVs, which is the key difference from the previous 


calculable models. Thus, the spontaneous CP violation 
is required to occur much above the weak scale through 
the singlet VEVs, mediating it to light quarks through 
mixing with vector-like heavy quarks. In modern terms, 
the heavy quarks can be considered as the mediation sec- 
tor. Then, integrating out heavy fields, we obtain the SM 
quarks with the Kobayashi-Maskawa type weak CP vi- 
olation. To ensure Arg.Det.M, = 0 at tree level, the 
specific forms for the Higgs couplings to the SM quarks 
and the superheavy vectorlike quarks are needed. Be- 
yond the tree level, however 0 is generated at one loop, 
typically with the form Ed , Bento, Branco, Parada 
(1991), Goffin, Segre, Welson (1980)], 


lon 
Ü eai © 15:244 S "(loop integrals) (10) 


where Af? is the product of couplings and the Feynman 
loop integral is of O(1). To satisfy the bound (Q), the 
small coupling Af? is needed. Some mechanism such as 
the family symmetry may be needed to forbid Oweax at 
one loop kd bed Chang, Keung (2004), Nelson (1984)]. 

This kind of Nelson-Barr type calculable Oya, can be 
mimicked in many extra dimensional models including 
superstring. Recently, for example weak is calculated to 
be O(10- 2) from two loop level in a sequestered flavor 
and CP model [d Cheung, Fitzpatrick, Randall (2008)]. 

Strictly speaking, the axion models also belong to the 
calculable models but we separate it from the models 
with spontaneous CP violation because there it is not 
needed to set ĝo = 0. 


2. Massless up quark 


Suppose that we chiral-transform a quark as q — 
e^5* 3. Then, the QCD Lagrangian changes as 


/ ea dm - egeo 
J d's[-m4g? "^q — (0 —20)fg2GG}] — (11) 


where {GG} = (1/641”)e"”?7G*,G%,. If mq = 0, it is 
equivalent to changing 0 — 0 — 2a. Thus, there exists a 
shift symmetry 0 — 0 — 2a. It is known that the tun- 
neling amplitude due to instanton solutions with a zero 
mass quark vanishes [34d, 't Hooft (1976)], which im- 
plies that the shift symmetry is an exact symmetry. In 
this case, 0 is not physical, and hence there is no strong 
CP problem if the lightest quark (i. e. the up quark) 
is massless. The question for the massless up quark so- 
lution is, “Is the massless up quark phenomenologically 
viable?” Weinberg's famous up/down quark mass ratio, 
Z=™,/Ma, gave Z = 5/9 , Weinberg (1979)]. It is 
very similar to the recent compilation of the light quark 
masses, m, = 2.6779 MeV,ma = 4.9414 MeV, and 
Z = 0.48* 13 which is shown in Fig. This compi- 
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FIG. 6: The allowed m, — ma region peg. Two downward 
sloping lines are from the bound on (mu+ma)/2 and two ris- 
ing lines are from the bound on my /ma, which are determined 
by the masses of the meson octet. Two vertical and horizon- 
tal boundaries are from the Particle Data Book bounds on 
m, = [1.5, 3.3] MeV and ma = [3.5, 6.0] MeV f. 


lation is convincing enough to rule out the massless up 
quark possibility , Kaplan, Manohar (1986)]. In this 
review, we will use Z — 0.48 when a number is needed 
though the appropriate bound may be 0.35 « Z « 0.60 
bà , Buckley, Murayama (2007), Manohar, Sachra- 
jda (2008)]. 

For some time the massless up quark possibility was 
taken seriously Pid, Kaplan, Manohar (1986). The 
reason is that even if the Lagrangian mass for the up 
quark is zero, the ’t Hooft determinental interaction may 
generate a useful up quark mass for the chiral pertur- 
bation. There was a confusion on this issue for some 
time [80,262 Leutwyler (1990), Choi (1992)]. Now, it is 
clear that the massless up quark possibility is ruled out, 
even without using the lattice calculation of the ratio, 
my,/mq = 0.410 + 0.036 P86, Nelson, Fleming, Kilcup 
(2002)]. 


III. AXIONS 


The axion solution seems to be the most attractive 
one among three possible strong CP solutions, in partic- 
ular at present when the massless up quark possibility is 
excluded and calculable solutions need one loop suppres- 
sions. 


Peccei and Quinn tried to mimic the symmetry 0 — 
0 — 2a of the massless quark case of by considering 
the full electroweak theory nde, r | 93, Peccei, 
Quinn (1977)]. They found such a symmetry if H,, is 
coupled only to up-type quarks and Hg couples only to 
down-type quarks, 


+ (h.c.) - 01GG). 

(12) 
Certainly, if we assign the same global charge under the 
^s transformation to H, and Hg, q — e'%%q,Hy, — 
et? Hu, Ha — e? Ha, the flavor independent part changes 
to 


L = —drunH, — dpdnHa — V (Ha, Ha) 


£—-— gre ure’ Hy, = dne dget Ha 


— V (e? Hu, e Ha) + (h.c.) — (0 — 2a) {GG}. (13) 


Since the full Lagrangian must possess the global sym- 
metry, the potential V should not allow the (H, H4)? 
term. Choosing 8 = a achieves the same kind of 0 shift 
of the massless quark case, which is called the PQ global 
symmetry U(1)pa. Unlike the massless up quark case, 
here 0 is physical. Even though the coefficient of {GG} 
changes in the same way in Eqs. (II) and (13), these two 
cases differ in that the tunneling amplitude vanishes with 
a massless quark (for which a detailed discussion will be 
presented in Subsec. [[ITB] and does not vanish without a 
massless quark. The reason is that the Higgs fields trans- 
form under U(1)pa, and one of the Higgs fields, called 
‘axion’ a, has the shift symmetry a — a + (constant) 
and corresponds to the Goldstone boson of the spon- 
taneously broken U(1)pa Bed, 367, Weinberg (1978), 
Wilczek (1978)]. So we call the resulting axion from Eq. 
(13) the Peccei-Quinn-Weinberg-Wilezek (PQWW) ax- 
ion. If the consequence of the determinental interaction 
is only Fig. [I] then out of two bosons 7 and a, only 
1 obtains mass by the RHS diagram of Fig. [J and a 
remains massless. If a remains massless, the strong CP 
problem is solved as envisioned in [294 Peccei, Quinn 
(1977)] since for any 0 we can choose the VEV (a) such 
that the final 0 is zero. This was Peccei and Quinn's 
idea that (a) has a shift symmetry mimicking the mass- 
less quark case. However, a has interactions and can be 
produced in the stars and K meson decays, which differs 
from the massless quark case. 


At the classical Lagrangian level, there seems to be no 
strong CP problem. But, the axion coupling to {GG} is 
generated at one loop level, which is the U(1)pog-QCD- 
QCD anomaly. The ’t Hooft determinental interaction 
we mentioned above is exactly this anomalous coupling. 
With this one loop term, the Lagrangian is not invariant 
under the phase shift symmetry f, or a — a+ (constant). 
Since it is explicitly broken at one loop level, the phase 
field @ of the Higgs fields, or axion a does not have 
a flat potential, ie. Fig. [I is not complete. Wein- 
berg and Wilczek interpreted this phenomenon using the 
spontaneous symmetry breaking of the global symme- 
try U(1)pq. It is said that 6 is made dynamical where 


0 = a/ F,, but in the PQWW axion case the component 
was there from the beginning in the phases of the Higgs 
doublet fields. The free energy depending on — cos is 
the potential for the axion. Since it is proportional to 
— cos0, the minimum of the potential is at 0 = 0 in 
CP conserving theories , Vafa, Witten (1984)], and 
thus the vacuum chooses 0 — 0. We must discuss this 
effect below the chiral symmetry breaking scale as will 
be discussed in Subsec. [JLB] Thus, the axion solution 
of the strong CP problem is a kind of cosmological solu- 
tion. Note however that the weak C7? violation shifts 0 a 
little bit, leading to 0 ~ O(10-17) , Georgi, Randall 
(1986)]. 


The PQWW axion is ruled out quickly 147, BoJ, Don- 
nely et. al. (1978), Peccei (1978)], which was the reason 
for the popularity of calculable models in 1978 as dis- 
cussed in Subsubsec. IBI] Nowadays, cosmologically 
considered axions are very light, which arises from the 
phase of SU(2)xU(1) singlet scalar field c. The simplest 
case is the Kim-Shifman-Vainstein-Zakharov (KSVZ) ax- 
ion model | 831, Kim (1979), Shifman, Vainstein, 
Zhakharov (1980)] which incorporates a heavy quark Q 
with the following coupling and the resulting chiral sym- 
metry 


L=-Q1Qro + (h.c.) — V(\o|?) —O{FF}, (14) 
b= Ope Qro + os — V (lol?) 
— (0 — 2a){GG}. (15) 


Here, Higgs doublets are neutral under U(1)pg. By cou- 
pling o to H, and H4, one can introduce a PQ symme- 
try also, not introducing heavy quarks necessarily, and 
the resulting axion is called the Dine-Fischler-Srednicki- 
Zhitnitskii (DFSZ) axion [135, B79, Dine, Fischler, Sred- 
nicki (1981), Zhitnitskii (1980)]. In string models, most 
probably both heavy quarks and Higgs doublets con- 
tribute to the c field couplings. The VEV of e is much 
above the electroweak scale and the axion is a very light 
azion.? The SU(2)xU(1) singlet o field may mix with 
the Higgs doublet component by a tiny amount, whence 
practically we can consider the axion as the phase of a 
singlet field c, o = [(v-- p)/ V2]e/*//* with the identifica- 
tion a = a+2rNpw Fa with the axion period 27 Npw Fa. 
Note that we use fs for the VEV of ø or the value rel- 
evant in the field space and Fa defined from the coef- 
ficient of the anomaly term. Namely, the coefficient of 
the anomaly {GG} defines Fa as 0 = a/F, while the 
VEV(v) of o,o œ e'*/", defines fs. The periodicity 27 
of 0 implies that Fa cannot be larger than v = fs, and 
we have Fa = fs/Npw. It has been shown that mod- 
els with Now # 1 has an energy crisis problem in the 
standard Big Bang cosmology , Sikivie (1982)]. But 


? Once it was called an invisible axion sd, Bea, Wise, Georgi, 
Glashow (1981), Nilles, Raby (1981)] but is better to be called a 
very light axion due to the possibility of its detection. 


models with Npw = 1 do not have such a problem due 
to the mechanism to convert the two dimensional axionic 


domain wall disks surrounded by the axionic strings into 
radiation Bi, Barr, Choi, Kim (1987)]. 


A. Axion shift symmetry and reparametrization 
invariance 


In the original PQWW axion model, the Lagrangian 
in the effective field theory language has been extensively 
discussed | E| Donnelly, Freedman, Lytel, Peccei, 
Schwartz (1978), Peccei (1989)]. Here, due to the sim- 
plicity in the formulae, we present the variant-type axion 
models where the PQ charges are assigned only to the 
right-handed quark fields (25, Bardeen, Peccei, Yanagida 
(1987)]. This discussion will make it easier in introduc- 
ing our general formulae below. The PQ current is p3, 
Bardeen, Peccei, Yanagida (1987)], 


Ng N 
Ip? =FyOua+ 2 V dgiyudni + (1/2) >> tre yun 
i=1 i=l 
Ng 
+ (—2) 5 URiYuURi (16) 
i=N+1 


where N, is the number of families, N is the number 
of up-type quarks coupled to H,, and x = (H,)/(H,). 
'The color anomaly is non-vanishing, i.e. the divergence 
of JP 9 is 


1 1. a, 
aN pried = zye F Di 


4mauü|[i»ysei*15/ Fo + mad[i-ys e/*57/ F«]q (17) 
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where we considered the one family model of u and d 
with N — 1. If N were zero, there is no color anomaly. 
For a nonvanishing N, we have to pick up the compo- 
nent orthogonal to the longitudinal Z,,. Since the axial- 
vector part of Z,, current is proportional to J hee any ax- 
ial U(1) current orthogonal to the longitudinal Z,, is an 
SU(2)aavor singlet current constructed in terms of right- 
handed quark fields. These include the currents corre- 
sponding to both 7’ and the PQ phase. Since r/ is known 
to be heavy, we integrate out 7 to obtain light fields 
below the chiral symmetry breaking scale. This corre- 
sponds to picking up an anomaly-free piece, orthogonal 
to the longitudinal Z,,. It is 


1 
14+2Z 


(yu ysu + Zdy,'ysd) 
(18) 
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where Z = m,/maq. The divergence of is propor- 
tional to muma which must be the case for the particle 
orthogonal to 17’. 

Below we use the typical axion model (14) because 
of its simplicity in assigning the PQ charges whenever 


an explicit example is needed. It has the following 


U(1)pq charges, I, 


Field oO Qr Qn 
T 1 +4 -j 


In this example, the axial-vector current for U(1)pQ is 
de = Qu + vô a where a is the phase field 


of o = (v/V2)e'*/", The current corresponds to 
the charge flow which satisfies the current conserva- 
tion equation if the symmetry is exact. But the 
axial-vector current is in general violated at one loop 
by the anomaly a Ba Adler (1969), Bell, Jackiw 
(1969)], aH JP = (Ngg?/32n? )cs Ges. or a = 
(Ngg2/327? “Gt, G^" + (mg/v)QiysQ with the Q 
number No, which shows that the axion interaction with 
the SM fields is only the anomaly term (plus the anoma- 
lous coupling with the SM gauge fields). Here and in Eq. 
(10), we explicitly write the QCD coupling g?, but in the 
remainder of the paper we absorb the gauge coupling in 
the gauge fields except in the experimental section, Sec. 
[V] This axion is the one settling 0 at zero; thus one needs 
the axion-gluon-gluon anomalous coupling for which the 
color anomaly of J p should exist. This kind of symmetry 
T is called the PQ symmetry. 

Axions are introduced as the Goldstone boson degree 
of a spontaneously broken global U(1)pq symmetry in 
renormalizable gauge models [o2d, Peccei, Quinn 
(1977), Kim (1985)] and/or as a pseudoscalar degree in 
a more fundamental theory where the axion interaction 
arises as a nonrenormalizable anomalous interaction in 
the effective low energy theory. The most compelling 
nonrenormalizable interaction was observed in the com- 
pactification of 10-dimensional superstring models 
Witten (1984). Below, we will treat that the axion is 
present as a dynamical degree at the electroweak scale 
whether it arises from the spontaneously broken PQ sym- 
metry or from a more fundamental theory with a non- 
renormalizable anomalous coupling, and focus on QCD 
interactions containing the axion degree, a = 0F,. Then, 
let us collectively write the most general form of its inter- 
actions: the cı term is the derivative coupling respecting 
the PQ shift symmetry, the c9 term is the phase in the 
quark mass matrix, and the c3 term is the anomalous 
coupling or the determinental interaction ZCaet, 


=- | 13000, j= mi GW + (Lila + grian) 
+ a 0)q" sq — 3 m qne? + h.c.) 


— GG (or Lact) (19) 


Cv 55,2 327 2 Fem, mo F Lieptons,ð 

where 6 = a/fs with the axion decay constant fs up to 
the domain wall number (fs = Npw Fa), q is the fermion 
matrix composed of SU(3)e charge carrying fields. When 


the singlet scalar fields are easier to discuss, we use fs, 
and when the anomaly term is easier to discuss, we use 
Fy. Lieptons,o is the axion interaction with leptons. ci, c», 
and c3 are pre-given coupling constants below the ax- 
ion scale fs with the mass parameter m defined to be 
real and positive below the electroweak scale. Then, the 
determinental interaction can be used instead of the c3 
term, 


—ic30 


Laet = —27licg@(—1)%5 Det(gqgr)-r h.c. (20) 


K3N;-4 
where we multiplied the overall interaction by 0 in the 
small 0 region and require the periodicity condition, 
c30 = c30 + 2m. The periodicity can be accomodated 
automatically if we replace —2-!ic40 by 1, but then we 
must add a constant so that it vanishes at 0 = 0. The sign 
is chosen such that the potential is a minimum at 0 — 0 
Bsa Vafa, Witten (1984)]. With the fixed phases, the c3 
term is given from the QCD vacuum structure (@) which 
does not have any dimensional coupling. But the instan- 
ton physics necessarily introduces the instanton sizes and 
hence a kind of QCD scale K for the interaction respect- 
ing the chiral transformation property for a flavor singlet 
operator Lact. We will use either the anomaly term or 
Lact. The 6 dependence of the form is —c30 sin(c30) 
which has the parity symmetry 0 — —0. The Fourier 
expansion satisfying these is 

—2~1¢3 sin(c3@) = —27 [1 — cos(c3)] + 5 an cos(ncs0) 


n=2 


where the Fourier coefficients satisfy, 35^. , nan = dio. 
Neglecting the n > 2 terms, we will use just the cos(c39) 
dependence. 

In the defining phase Eq. (19), the PQWW axion is 
given by c1 = 0,co Æ 0, and c3 = 0, the KSVZ axion 
by cy = 0,cg = 0, and c3 Æ 0, the model-independent 
axion Bi Witten (1984)| in superstring models by 
cy = 0,co = 0, and c3 4 0, and the DFSZ axion by 
€i = 0,cg Æ 0, and c3 = 0. In general, axion models 
from high energy will have co Z 0, and c3 4 0, and the 
shift symmetry allows cı z 0 in a different basis. For 
simplicity, we discuss Eq. for one flavor QCD first. 
For N; flavors, both c; and 0 are defined from Ny x Ny; 
matrices in addition to the anomalous coupling and hence 
the axion is included in Tr which also contains the 1’ 
meson part of QCD. For N; flavors, c;0 must be replaced 
by 'Trc;0. For the following illustrative discussion, we re- 
fer to one flavor QCD, but in Subsec. [IILB]in the axion 
mass estimation we present the full Ny flavor QCD result 
with the chiral symmetry breaking taken into account. 

For the case of axion mass, c1, c9 and c3 terms may be 
relevant, but only the combination c2 + c3 appears. This 
Lagrangian has a shift symmetry a — a+ (constant), 
which reparametrizes the couplings between c;,co, and 
c3. Explicitly, the axion field dependent changes of the 
quark fields qr, — ez, and qr — e7***™ap give 
Cy — C1 — Q, C2 — C2 — 2a, C3 — C3 + 2a, and it must give 


the same physics, i.e. mes 227, Kim (1987), Georgi, 
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FIG. 7: The case with Npw = 3 where three vacua are dis- 
tinguished. 


The reparametrization symmetry dictates the non- 
derivative couplings satisfying c2+c3 = (constant), which 
is one of the reasons that we use 0 = Ügrp + focn = 
Oo + weak aS a physical parameter in axion models. Usu- 
ally, transferring all couplings of axion to the coefficient 
of GG, the axion decay constant Fa and 0 are defined. 
Instead, if we use fs (defined to be the VEV of the sin- 
glet Higgs field e), there exists the coefficient c3 defined 
in Eq. (9). The triangle diagrams may give an integer 
times 0 and the instanton potential comes to the original 
value by a 0 shift of 2r/(c2 + c3), with c2 + c3 = Npw 
not necessarily 1 in the pseudoscalar field space. Thus, 
this integer is called the domain wall number Npw 
Sikivie (1982)] 


Npw = [ca E ca] = TrT ( feolorea) (J colored) (22) 


where the trace is taken over all heavy and light quarks 
and £ is the index of SU(3)« representation of colored 
fermions and the PQ charge is given for the left-handed 
chiral representations. The height of the potential is 
O(Agcp) of the nonabelian gauge interaction, which is 
shown in Fig. [J with the domain wall number Npw = 3: 
the bullet, the square and the triangle denote different 
vacua. Two important properties of axions in CP con- 
serving theories are: 


(i) periodic potential with the period 27 F, where Fa 
is defined in (I9) with Fa = fs/Npw, and 


(ii the minima are at a = 0, 2x F,, AnF,,---. 


''his determines the cosine form of the potential. There 
exist the axion mixing with quark condensations as we 
will discuss it in more detail later. 

The derivative coupling, i.e. the cı term, can never 
contribute to the PQ symmetry breaking effect, espe- 
cially to the axion mass. This axion gets its mass from 
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Tomaras, Pais (1981)], 


Q, C2 — 20, C3 d 2a, m, Aoc]. (21) 


m 


the 0 anomaly term which breaks the PQ symmetry. The 
global symmetry is not broken by the derivative term 
which therefore cannot contribute to the axion mass. 
From the reparametrization invariance (ZIJ, the combi- 
nation c9 + c3 is the correct combination for the axion 
mass, which is shown explicitly below. This derivation 
is included with a more complicated expression in the 
SUSY extension, but we show the co + c3 dependence in 
this supergravity framework because it is the underlying 
symmetry in many axion models. Some of the following 
discussion is derived from [93, Choi, Kim, Nilles (2007)]. 


Supersymmetrization 


We will discuss the reparametrization invariance with 
the SUSY generalization. In the M = 1 SUSY mod- 
els with chiral fields z, there are the superpotential W (z) 
and the gauge kinetic function f(z) both of which are the 
holomorphic functions of z. The superpotential gives the 
c9 term and the gauge kinetic function gives the anomaly 
term c3. The PQ invariant Lagrangian, the cı part, has 
the shift symmetry under the shift of the axion super- 
multiplet: A — A + i(constant). This derivative cou- 
pling must appear from the D terms in SUSY models, 
ie. through the Kahler potential. The real Kahler po- 
tential K(z,z*) must respect the PQ symmetry in the 
form of A+ A, 


K = Ko[A +A] +{Z[4+ Alana the} — (23) 


where the 0? components of the fields are implied and 
‘q’s denotes quark supermultiplets, 


q = Yq tq (24) 


with the anticommuting variable ð. Here, we used 0 for 
the anticommuting Grassmann number since 0 in this 
review is reserved for the axion 0 = a/ Fa. 


B. Axion mass 


'The axion mass arises from the anomaly coupling OGG. 
In this subsection, first we show that only the c2 and ca 
couplings are the relevant ones for the axion mass, and 
then we present the axion mass in the broken phase of 
the chiral symmetry. With SUSY, the discussion is a 
bit tricky, because the axion remains massless due to the 
massless gluino (as in the massless up quark case with 


a spontaneously broken PQ symmetry). For the axion 
mass, therefore SUSY must be broken and here one has to 
look at all supergravity terms how they contribute to the 
axion mass. Nevertheless, we have the master formula 
(ZI) for the axion which must be valid even when SUSY 
is broken. In this regard, SUSY is not special for the 
axion mass but only the anomaly consideration is the 
chief constraint. Thus, the following discussion applies 
even without SUSY, but let us discuss the axion mass 
in detail with the SUSY generalization to include the 
gluino effects and hence the c4 type derivative couplings 
to matter (quarks) and gauginos (gluinos). 

We have noted that there exists the famous anomaly 
coupling of the 7’ meson which is the mechanism solving 


11 


the old U(1) problem of QCD. In addition to 7’, the axion 
a is introduced in the anomaly coupling and_hence one 
must consider the mixing of 7 and axion bd | 
Baluni (1979), Bardeen, Tye (1978), Kim, Kim (2006)]. 


The c3 term is the anomaly coupling of the ax- 
ion, and we normalize the anomaly as the coefficient 
of eP ey ,€08k1,k2g. With this normalization, from 
capy50^ APO? AP leading to —eo PY, vei gkay€26; the C3 
term anomaly is defined with A3 = 1. 


It can be shown that, using the Kahler potential 
the kinetic energy terms of fermions contain ! 
Cremmer, Ferrara, Girardello, van Próyen (1983), Nilles 
(1984)) 


E = 1 - = _ 
X Za (Gi + s Buys + Yaua) +5 (rior — ga) (25) 
v A 


where B, and Y} „ come from the auxilliary E we ob of real Ko and Z,, respectively. In terms of the real parts 


R and y of Ko and Z (redefined from Z, of (23)), we obtain 
| 4 ( OKo Fo Ó 
| i ( 0Ko OKo oln Za ðln Zaa =N _ Olny 
ge 2 (Sea. id aA ——0O e) ( OA — aA OA =2 EY! Opa (27) 


where 


-K 
G = —31n (=) + In |W?, K= —e7 hale. Zq = ef, 


Ky =R+ikor =R, Z,=V+il=). (28) 


The c3 term is an anomaly term. In addition to the c3 
term, the cı and c» couplings via loops of Fig. [E] will also 
generate the anomaly terms. The derivative coupling, if 
it has ever to contribute to the axion mass, should con- 
tribute to the axion mass by the anomaly through loops. 
In Fig. the couplings for the triangle diagrams are 
represented in terms of c; and co. In supergravity mod- 
els, we consider B,, and Y,,,, couplings which are nothing 
but cı. Consider a fermion with mass m. The derivative 
coupling through Fig. [S] has the anomaly coupling by 
picking up the coefficient of ce Eogki,kag , See, 
for example, Georgi, Tomaras, Pais(1981)], 
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FIG. 8: The Feynman diagrams for generating anomalous 
OGG couplings from cı for a fermion with mass m. For c», we 
replace ci*,^5 by comoys. 


Also, the quark mass term of Fig. B] gives 
1 1-2, 2m? 
Ag J de | drao, (80 
*' de do "m? — f(z1,22; q, kı, k2) 90 
From Eqs. and (B0), we construct 


Í " 1—21 
—A; + Ag a ax, | 2 dr = 1. (31) 
2 0 0 


When we calculate the axion mass in the real and pos- 
itive quark mass basis as usual, the anomaly coupling 
(a/Fa){GG} coupling (including the loop effect) is the 
sole source to the axion mass. In this basis, and also 
in any basis due to the reparametrization invariant com- 
bination co + cs, we do not have to discuss the deriva- 
tive couplings toward the axion mass. But, even though 


LLL 


Case (i): m «& Agcp :Tipr = 


Case (ii) Dmm Agcp :Tipr = 


Consider the quark mass term and the one flavor determi- 
nental interaction with the quark condensation, (qrqg) ~ 
Aone” f. 'Then, the potential takes the form 


V= m(gLqr) e? + h.c. + (c3 + cA + C2A2) {aa} : 

(34) 
For the anomaly combination ca + c1.A1 + C242, the 
reparametrization invariance Eq. (ZI) transforms c3 + 
cy A, 4- €2.À2 to c3 +204 (e a).A1 t (c2 2a) Ae = g+ 
c,A, + c2A2 where (21 is used, i.e. it is reparametriza- 
tion invariant. 


For Case (i), we consider the light quark below the 
scale dcp: Thus, we have 


/ 


i 
V = mv? cos (4 = a6) + Mop cos (o + 2c1)0 + z) 


for which we choose cı = 0. [If we kept c1, we must 


consider the kinetic mixing of a and »/.] Integrating out 
the heavy 1 field as T = —c37 from the Acn term 


which is the larger one, we obtain 


V = mv’? cos (e. + eM) 


from which we obtain 


[co + es] 


fs 
Quarks u,d, and s belong to this category. 


Ma ~ mAb (35) 


For Case (ii), the heavy quark does not condense and 
integrating out the heavy quark gives 


V = A4 cos (( +c x) 
QCD (c3 T 


167? 
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the derivative coupling generates the anomaly, it being 
derivative does not contribute to the axion mass. 


For one flavor QCD, we can check the above statement 
explicitly using Eqs. (299330351). In the following two 
limiting cases, the integrals are easily computed as, using 


Eqs. @9) and GO, 


167? 


2 
eP ky kag c + 2c, + vUa) (32) 

1 apbyd k? 
—— e ke kag | c3 + 02 + OC) (33) 


m 


from which the axion mass is given by 


[co + c3] 


fs 


Again the axion mass depends only on the combination 
co+c3. Heavy quarks above the chiral symmetry breaking 
scale c,b, and t give the c9 term and vectorlike heavy 
quarks above the electroweak scale give the c3 term when 
we wrote Eq. just below the electroweak scale. 


2 
Ma ~ AQcp 


Axion mass with light quarks 


In the real world, there exist three light quarks whose 
masses are much smaller than the QCD scale Agcp, and 
therefore the axion mass has the form anticipated in Eq. 
(85). Even though there are two light quarks the axion 
mass dependence has the form ym because of Fa > fr. 
'This is because of the way in picking up the leading term 
from the anomalous determinental interaction [223, Kim, 
Kim (2006)] as depicted in Fig. 

In fact, this is simply obtained just by noting that the 
instanton interaction is a U(1) singlet Kim (1987)]. 
Suppose we integrate out quark fields; then the quark 
mass parameters appear in the effective interaction as 
shown in the first diagram of Fig. In this vacuum in 
a theory with a massless quark, the tunneling amplitude 
vanishes so that the strength of the first diagram must be 
proportional to ma. With three quarks, we can generalize 
it as l/(mz! mq! -- m7). Suppose that there are only 
gluons and a very light axion a at low energy. Integrating 
out heavy fields, we are left with the flavor independent 
coupling aGG. Here we are not considering 7 even be- 
low the quark condensation scale. If quarks are added, 
the flavor singlet coupling aGG can be split to quark 
mass terms with a, X £/Mu, oq X z/mq, As X £/Ms, 
etc., as if quarks are not integrated over, m,ür uge'?"? + 
madrdge/^4? +--+, which shows that the flavor sin- 
glet coupling is of order O(a/F,). Then, even below 


gi( cd --03)0 


m, A? —V3 —v3 mal? 


= 
FIG. 9: The 't Hooft determinental interaction. e denotes the 
quark condensation and x denotes the insertion of the current 
quark mass. The diagram highlighted with yellow predomi- 
nantly contributes to 7’ mass, and O(muma) is neglected. 


the chiral symmetry breaking scale, we have the PQ 
charges proportional to 1/m4. With this definition of 
quark charges, the axion mass comes from integrating 
out GG, and is proportional to a, + aq + as which is 
N MuMams/[Ms(Mu + mq) + Muma], which was shown 
first for the PQWW axion bd Baluni (1979)]. This is 
true even in the heavy quark KSVZ type axion models. 
Even if the light quarks do not have the same PQ charge 
as in some variant axion models ba, [193 B34, E 
Krauss, Wilczek (1986), Peccei, Wu, Yanagida (1986), 
Bardeen, Peccei, Yanagida (1987), Kim, Lee (1989), 
Hindmarsh, Moulatsiotis (1997)], the axion mass has the 
same final form due to the reparametrization invariance, 
which will be shown below. So, the axion mass formula 
we write below is quite general. 
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However, there was an assumption in this statement: 
7’ was integrated out. So, it is necessary to include 7’ 
to obtain a more accurate axion mass. The light mesons 
and axion interactions must appear from those of Fig. 
In this framework, however, the flavor singlet condition 
must be invoked as a constraint. [This flavor singlet con- 
dition is the anomaly matching without r'.] Along the 
way, we would like to see how the ym dependence arises 
below the chiral symmetry breaking scale in the KSVZ 
model. 

In the presence of vectorlike heavy quarks, the heavy 
fields are integrated out, whose sole effect is encoded in 
the low energy effective theory as nonrenormalizable cou- 
plings suppressed by Fy, e.g. in the anomalous c3 type 
couplings with the SM gauge bosons. It is assumed that 
the heavy quark does not condense, since the QCD cou- 
pling is very small due to the asymptotic freedom at the 
heavy-quark Compton wave length scale and there does 
not exist a strong force to attract heavy quarks. Be- 
low the heavy quark scale, there is no massless mesons 
composed of heavy quarks. Therefore, the general form 
of the axion interaction Eq. ([9] is valid at low energy. 
Firstly, the determinental interaction has the same chiral 
symmetry behavior as that of the anomaly term, and the 
anomaly term is removed in favor of the determinental 
interaction to include 1 explicitly. Second, let us choose 
the basis where the u and d quark masses are real. Since 
the strange quark mass is known to be below the QCD 
scale, we must include the strange quark with real and 
positive mass also in the instanton interaction. For sim- 
plicity, 1? and 7, arising from quark condensations tu 
and dd with decay constants f; and f, (z fr) [164, Gell- 
Mann, Oakes, Renner (1968)], are considered explicitly 
but with the 7 meson frozen. The effects of heavy quarks 
are included in the c3 term. If we keep c1, the kinetic mix- 
ing of mesons and axion are present, due to the PCAC 
relation (0|J&(x)|meson;(k)) = —ik"f2e-'*"5;; where 
JE ~ q^y"*sTi;q. This would modify the axion mass, and 
hence it is easiest to calculate the axion mass by choosing 
the reparametrization parameter a such that cı = 0. In 
this basis, denoting 7°, 7’ and a in terms of dimensionless 
fields, 0, = 19/ fr, 0y = n'/ fy, 0 = a/ Fa, we obtain the 
following effective interaction below the chiral symmetry 
breaking scale, 


L= -Mu (ūpupje lerte) +36] = ma(dpdp)etl(—Ox+4n +29) + h.c. + Last (36) 


where Laet is given in Eq. 


Lig (-1) tK (arun) (ddr) (8, s gpye'O0v T9) + ...+ flavor singlet constraint) + h.c., (37) 


where K has the mass dimension arising from QCD 
instanton physics. The above form is consistent with 


the anomaly with cı = 0. Note that the log det 
form in the effective Lagrangian was used in fad [d 


B51 Bed. 70, Veneziano (1979), Di Vecchia, Veneziano 
(1980), Witten (1979, 1980), Di Vecchia, Nocodemi, Pet- 
torino, Veneziano (1981)] from the 1/N, expansion con- 
sideration, but we use (B) because of its simplicity in 
the diagramatic expansion. The sign of the first dia- 
gram inside the box in Fig. Mis determined to be mi- 
nus without the weak CP violation , Vafa, Witten 
(1984)]. The QCD vacuum with the flavor indepen- 
dence of light quarks without the determinental inter- 
action chooses mq (Gq) = —|mq|v? and we choose the sign 
of all quark masses be positive so that (qq) = (uu) = 
(dd) = —v® [2d fed, bz b53 le dom) bm 
gacker, Pagels (1973, 1979), Gasser, Leutwyler (1982)]. 
Eq. BJ is the instanton interaction of Fig. which 
gives Af, m, A?, MgA,- by many ways of closing quark 
lines, shown in Fig. D] but here one must invoke the fla- 
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vor singlet constraint. The dominant term is the second 
diagram highlighted as yellow, which is flavor singlet and 
is the main source for the 7 mass. 


Now let us restrict to the two flavor case. For the ax- 
ion, the key diagrams are those in the second line of Fig. 
If there are more than one QCD axion, then O(muma) 
diagram will be important at the next level axion mass. 
Integration over the instanton size includes very large 
instantons which covers the chiral symmetry breaking 
range where mesons appear as dynamical degrees, where 
we will invoke the flavor singlet constraint. The effec- 
tive interaction Hamiltonian of 6,,6, and 0 = a/ fs can 
be written, using the reparametrization invariance (ZI) 
with Ny = 3 and 7 fixed, as P23 Huang (1993), 
Kim, Kim (2006)], 


9 
-V = m? cos (0, + 0y) + mav? cos (C04 + Oy) + X cos (20, — (c + c3 + ex)0) 
A26 2,6 
+ Mu eR cos (—6; + Oy — (c3 + c$ + c3)0) + ma rs cos (On + Oy — (c3 + cd + c3)0), (38) 


where A,, and Ag are parameters describing the result 
of the Feynman and instanton size integrations. The 
(—1)F is cancelled by the fermion loop or (—v) factors. 
If m, = ma, A, and Ag are equal. For m, Æ mq, Au 
and A4 must be different. The instanton interaction is 
flavor independent, which should be respected in the in- 
teraction (28). The m, and ma linear terms from the de- 
terminental interaction should be flavor independent, i.e. 
Mu A2 + MaA? = flavor independent. Since it vanishes if 
one quark is massless, it must be a function of Muma. 


C^[A7, + 2uA3 


inst] 


pe 


M; -2c[A7, + HAhstl/ fF 


mun T 


0 


where c = ct + cd + c3, F = fs, f = fr, f' 
vê / K”? 


Ma — m4, and 


Fars AL = 


3 — [2,3/«2 = 
and Až a = L^v?/K*,m, = my + ma,m_ 


I 


Tha TI a 
= M. 40 
pe LIS (40) 


Certainly, Eq. (89) realizes the solution of the U(1) prob- 
lem due to the AL term in the (22) component. In the 


[AA a 2uN? ot + mv") / f^? 


-m/f 


m 


Thus, the instanton size integration with current quark 
masses must give M,A? +m4A? = 2m,maL? / (m, +ma), 
which vanishes if any quark is massless. This is because 
the original gluon anomaly term {GG} does not distin- 
guish flavors, and the smallness of the current quark 
masses enable us to expand the 't Hooft determinental in- 
teraction in terms of powers of the current quark masses. 
Then, the 3 x 3 mass matrix M? of a, and 7°, tak- 
ing into account of the chiral symmetry breaking and the 
solution of the U(1) problem is given as 


—2Qc[AF, s UM) / f F 0 


—m_v*/f f' (39) 


(mv? + Qu Ad ot)/F? 
m 


limit f/F, f'/F « 1, we obtain 


3 

MU + 2uAD. 

mmo = po (41) 
44$, s m.Lv? T 2uA s 

mj, e fnt ED (42) 

n’ 
2 g Z 2,2 
mix SH fimi (eA) (43) 


a ~ F? (1+ Z)? 


where 


m? Aa (m.Lv? sr HAT a 


m. mo fr 


A= (44) 


In this form, the 7 mass has the standard m., v? plus the 
instanton contribution to light quark masses El 
Kaplan, Manohar (1986), Choi, Kim, Sze (1988)]. From 
(41922), we estimate the parameter AT which is the 
source of the solution of the U(1) problem: Aj, = 
( m. — f2m?)/4 ~ (202 MeV)* with fy ~ 86 MeV 
and fr ~ 93 MeV. In any axion model, this form is valid 
with |c| = Npw. Using the standard definition on the 
axion decay constant F;, = F/c, we obtain 


EE fama 4A). (45) 


"e =F Fe 


So, even though the instanton diagrams of Fig. [9| con- 
tain the linear quark mass diagrams summed over, the 
diagonalization process with mesons signals the predom- 
inant contribution of the lightest quark. The flavor sin- 
glet condition we discussed before chooses the following 
linear quark mass dependence 


(sy. (46) 


Neglecting instanton contribution to the current quark 
masses, we obtain ma ~ 0.60 eV [10’GeV/F,], for the 
mass ratio Z ~ 0.48 as summarized in Manohar, 
Sachrajda (2008). An earlier frequently cited Z is 5/9 

; , Weinberg (1979), Gasser, Leutwyler (1982)]. 
The correct axion mass has to include the current quark 
mass change due to instantons. However, the resulting 
estimate of A turns out to be small. 


Comparison with old calculation 


Now, let us comment on the old anomaly matching 
condition. If any quark mass is zero, there exists an ex- 
act symmetry a — a+ (constant), i.e. axion is massless, 
above the chiral symmetry breaking scale. Below the 
chiral symmetry breaking scale, it is likely that this con- 
dition is satisfied. Let us denote the original current as 
Jbg This current is anomalous above the chiral symme- 
try breaking scale, ô Jbg = (NQ/32n?)G2,Ge"" where 
Ng is the number of heavy quarks with T = 1/2. Below 
the chiral symmetry breaking scale, we considered two 
pseudoscalar mesons which have anomalous couplings: 
7 and a. The global anomaly matching condition will 
work if there were no chiral symmetry breaking B41, t 
Hooft (1979)]. For chiral symmetry breaking, there is 
no massless fermions and we consider only color singlet 
mesons below the chiral symmetry breaking scale. Thus, 
the bosonic current must be anomaly free after integrat- 
ing out all heavy fields including 77’, i.e. we consider an 
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anomaly free current J/ instead of Jbg below the chiral 
symmetry breaking scale 227, Kim (1987)], 


No .. - 


EL 

Je = Jp 

where the divergence of the second current gives a singlet 

pseudoscalar density so that axion does not mix with 7°. 

The expression with A shows that the finite n’ mass 
enters into the a — 7 mixing. 


Mesons without axion 


Even if there is no axion, we can diagonalize the mass 
matrix. If m, = 0, one starts with an exact up quark 
chiral transformation, which would lead to a Goldstone 
boson 0 in the vacuum, (ŭu) #4 0. This Goldstone boson 
couples to neutron through (c18,0)ny”ysn. In reality, n 
obtains mass by the anomaly, and the symmetry remains 
unbroken: it is the phase symmetry of (uu). Therefore, 
any violation of the shift symmetry must be invoked such 
that it goes away in the limit jj — 0, which is Dashen's 
theorem [120, Dashen (1971)]. Thus, from Eq. (Z8) we 
obtain the VEVs of n’ and 7° for a small 6 


(n') 0 pv 

P eoa t Eg 

wla ia. (48) 
f ma. 


The VEVs of 7’ and 7° are vanishing if 0 = 0 or any quark 
mass is zero. In addition, we can estimate the 7’ proper- 
ties from the interaction (v? / K?) cos(2n'/ fy) where fy 
is the 7’ decay constant and K has a mass dimension. 
It comes from the yellowed diagram of Fig. Compar- 
ing 1? — 2y and 7’ — 2y decay widths, 7.74 eV and 
4.3 keV, respectively p, Amsler et. al. (Particle Data 
Group, 2008)], we obtain f? = (4/3)(m}, /m;)(T (x? > 


2y/T (y —^ 23))f2, or fy ~ 86 MeV. Fitting to the r/ 


T? 


mass, we obtain K = (v°/f?,m/2)1/> = 240 MeV. 


The 0 = 0 vacuum with axion 


We have shown above that the Lagrangian (38) chooses 
0 = 0 in CP conserving theories if 0, = 0 and 0, = 0, 
which is determined by QCD dynamics. However, if 
CP is broken, the vacuum value of 0 is shifted from 
the 0 — 0 value, by the presence of any linear term of 
04,0, or/and 0. The meson potential is invariant un- 
der CP with CP(x) = CP(n’) = CP(a) = —1. Such 
linear terms are generated by considering CP violating 
phases and chirality flipping (LR) insertions. So, lin- 
ear terms of a are generated by combining the 't Hooft 
determinental interaction and CP violating weak inter- 
actions. Linear terms of 7 and 7’ can also be generated 


by considering the weak interactions alone without the 
determinental interaction, but the conditions of the fla- 
vor singlet, chirality flipping (LR) and CP violating 
effects do not occur at one loop level. In the SM with 
the Kobayashi-Maskawa CP violation, Ref. [i3 Ellis, 
Gaillard (1979)] shows that a finite correction occurs at 
the fourth order, O(o?), leading to a very small NEDM, 
but infinite corrections occur from O(a"). These can 
give rise to a linear term of m. In the SM, the pioneering 
calculation with axion has been performed in chiral per- 
turbation theory to obtain 0 < 10717 fed [68 Georgi, 
Kaplan, Randall (1986), Georgi, Randall (1986)]. The es- 
timated 0 however is far below the current experimental 
limit of 1071. 


C. Axion couplings 


The axion interactions are given in Eq. (I9) which 
are depicted in Fig. [I0] where we have not drawn aww 
and aZ Z diagrams which are orthogonal to the ayy. The 
diagrams of Fig. [Olare complete for the low energy axion 
phenomenology, where the suppression factor, 1/Fa, by 
the axion decay constant is explicitly shown. 


FIG. 10: The Feynman diagrams of axion couplings. G and 
y are gluon and photon, respectively. c3 and cay y couplings 
are anomalous. 


1. Axion hadron coupling 


When we discuss axion—hadron interactions, which are 
the relevant ones for low energy laboratory experiments 
and physics at the core of supernovae, we must integrate 
out gluon fields. Technically, it is achieved by using the 
reparametrization invariance to remove the c30GG cou- 
pling. If we keep the c3 coupling, we must consider the 
axion-gluon-gluon interactions also, which is hard to be 
treated accurately with its face value but must be the 
same as the one in the c3 = 0 basis. In this way, the 
quark interactions are changed from the original value as 


= 1 
ĉi > & = C1 t 568 

Co — C9 = C2 + C3 (49) 
C3 — C3 = C3 — C3 = 0. 
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In the barred notation, there exist only ¢, and c3. 


We will discuss one family without separating ci,» 
into es first for an illustration, and then we will dis- 


cuss the cases with a and write down formulae for 
three families. Let us define the initial parameters c4, C2 
and c3 together with the definition of the vacuum an- 
gle 09 = @qcp. In principle, the initial vacuum angle 
can be a free parameter. Here, the vacuum angle Oacp 
is defined such that cı = 0. Picking up the axion de- 
pendent chiral rotation charge defined below the chiral 
symmetry breaking scale Eq. (4%), the chiral quarks 
in the chiral perturbation theory are transforming as 
qn > exp(iQ 40)qr, qr — exp(—iQ 40)qg where 


1 M^ . 1 1 


Qa = Saye "me! ma^ 


The derivative interactions of axion is obtained in this 
way [16 Kaplan (1985), Georgi, Kaplan, Randall 
(1986)]. 


For the KSVZ axion, we have cı = cp = 0 and c3 = 1, 
and the coefficient of the gluon anomaly term is (a/ F5) 4- 
0ocp. Hence, redefining the axion as a + F;0oQcp, we 
obtain? 


ES 
N 
I 
[e] 
— 


KSVZ axion (c1 = 0 E 
C c3 = i (51) 


Here, c9 must be split according to the flavor singlet con- 
dition to c! + cd, 0 ox UJ. 


For the DFSZ and PQWW axions, c1 = 0, c2 Æ 0 and 
c3 = 0. If a non-vanishing 0ocp is introduced here, we 
have, using the reparametrization invariance (ZI), c, = 
—c2/2,ch = 0, and ch = c2. Then, the coefficient of 
the gluon anomaly term is c»(a/ fs) + 0Qcp, and hence 
redefining axion as a+ (fs/c2)8Gcp and going back to 
the c3 — 0 basis, we obtain for one family 


DFSZ and PQWW axions : 
& = į(—c2 + &), (52) 
C) Z0, c4 — OQ. 


Again, c must be split according to the flavor singlet 
condition to c! + cd according to the anomaly matching 
condition, Eq. (12). 


Integrating out the heavy c field and heavy quark 
fields, the massless (at this level) degree a = F,0 which 
appears from the phase of the singlet field o = (lo) + 
Ze appears in the effective low energy Lagrangian. 


If there are multiple SM singlets S; carrying PQ charges 


3 The sign convention is stated below. 


and VEVs, then the axion component is proportional to 
1 22/2 
a— y 2 Tia (8), Va = oy lV) (53) 


where a(S;) is the phase field of $;. The PQ charges are 
defined such that the smallest nonzero absolute value(s) 
of the PQ charges is 1 so that every scalar field returns 
to its original value after 27 shift of its phase. Let us 
discuss axion couplings after integrating out the heavy 
fields. 


In the KSVZ model c3 is calculated by the 
triangle diagram of heavy quarks for the global 
anomaly. The domain wall number Npw is Npw = 
TrT (Qz)U(Qz), with Fa = V,/Npw where T(Qr) (de- 
fined as Qr, — ei (9290, under a > a+ F,0) is the PQ 
charge and /(Qr.) is the index of SU (3). representation. 
Every field is represented in terms of left-handed fields, 
and the PQ charges are defined such that the SM sin- 
glet o coupling to heavy quarks carries one unit of the 
PQ charge. If the light quarks also carry the PQ charge, 
then (22) gives Npw, which belongs to the generic very 
light axion model discussed below. The anomaly calcu- 
lation gives the one loop coupling (Npwa/Va {GG}, but 
since the vacuum angle 0 or axion is given by the coef- 
ficient of {GG}, Fa is defined by dividing Va of (53) by 
Npw and hence c3 = +1 where the sign coincides with 
that of TrU'(Qr)I(Qr). As a convention, let us choose it 
to be cz = +1, which is choosing the effective PQ charges 
of heavy quarks to be positive. Transferring c3 to co, we 
split c3 = c? + cd using the PQ charges of 0), 


KSVZ axion: 
at -ie (54) 
Su 1 Ad 


In the DFSZ model, c! and c% are calculated by trans- 
ferring the phase of o to H, and Hg by the PQ sym- 
metry such that (H9) = /2vje'T«"/V» and (H9) = 
vv, eiT«e/Vs if H*H%o? defines the PQ charge of c 
in terms of PQ charges T’, and Tg of H, and Ha. 
Here, a = V,@ is not the mass eigenstate and instead 
of V> the mass eigenstate à uses the decay constant 
Fa = [Tu +Ta)?V2 + T2v2 + 122] ~ (T, +Ta)Ve 
for V, >> Vu, va, and the axion component à = [(D, + 
Ta)Voa + 'avgwa(H,) + Tavewa(Ha)|/Fa ~ a, and 
Fa = V; /Npw. So, in the DFSZ model they are given by 
[58, Carena, Peccei (1989)]: c = EE. e = Bap. and 

EW EW 


d MT : 
ci^ = c3 = 0. By the reparametrization invariance, Eq. 


CI). we can use c" = —c2/2, e? = —cd/2, c = e$ = 0, 
and ch = c+ c = 1. Removing c5 according to the 
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flavor singlet condition, we obtain for one family 


DFSZ axion for one family : 


zu — _ val lzu ad .. _ val lad 

C —-—X£—-c30, 0 Ce + 30 (55) 
Lu. 1 asd _ Z 
C2 —1xz7» 02 — TTT 


where v, = |(V2H9),v, = | /2H9)|, vgw = (v2 + 
gais z 12 
v2)!/2. The PQ charges ct = pel and cd = a of Hu 
EW EW 
and H4 are obtained by considering the orthogonal com- 
ponent to the longitudinal mode of Z boson. Remember 
that the signs of p are chosen from the convention that 
the PQ charges of Hu, a are positive. This result is for one 
family. 


If we have Ng families, we can calculate the couplings 
just below the electroweak scale where all quarks ob- 
tain masses. Then, we obtain for three families, c% = 


2 c. 2 
c$2dzc- ME and @ = e = d = UE . Us 
: Ew , . . . EW 

ing the reparametrization invariance, we can calculate 
D ou Joos "E m — li 
c1, C3 and c$, just above 1 GeV: c3 = 0, c7 = —5c$ and 


cs = Nœ; f) = Ng. Then, we integrate out heavy 
quarks c,b, and t to obtain the effective couplings just 
above 1 GeV, which does not introduce any new c» terms. 
Now, there are three light quarks u,d, and s for which 
we use the reparametrization invariance to remove the 
cs term such that the isosinglet condition is satisfied, 
0" J^ is anomaly free where Jọ = Ja? — AuW 5U — 
aadyuysd — 557758 and oH JS = N,{GG}. Thus, 
Qu + aa +a&s = N; is satisfied and the SU(3)favor singlet 
condition of 0“ J/; determines 


Qy = ——, Aq = —, As = : (56) 
Mu Ma Ms 
with 
m Nym,mqms " Ng(mu + ma)Z 
MuMat+tmMums+mams = (1+ Z)? 


Therefore, removing c4 by the reparametrization invari- 
ance, we obtain 


DFSZ axion for N, families : 


EU 1 E Z 

C9 = 1-Z g» C2 LEZ g^ (57) 
E 1 =u v -d 1 -d Vu 

ej. = 22 T €i = 22 vL (58) 


If heavy quarks and also H,,4 carry PQ charges, we 
must consider all these. If one SM singlet ø houses the 


axion, then we obtain 


General very light axion : 
1 


= Gl Ny) 59) 
a= DL + N3) 60) 
a = xrcz us Ng) F I = 61) 
ad = ag" + Ng) F a Ôn, 62) 
where the PQ charges (+ or —) of H, and Ha deter- 


mine the sign (— or +) in front of DFSZ component and 
on = 1 or 0 if the corresponding Higgs doublets carry the 
PQ charges or not. For the MI axion from superstring 
which is a hadronic axion, in principle there can exist an 
additional contribution to c; as will be pointed out in 


If there is no heavy degrees carrying the PQ charges 
above the electroweak scale, then cg in the so-called 


LLL 
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PQWW model is given by the PQ charges of H, and 
Hia, 
PQWW axion: Same as Eqs. (27058). (63) 
All models have e, and c3. For the original cg term, 
different models give different values, for example some 
variant axion models |25, Krauss, Wilczek 
(1986), Bardeen, Peccei, Yanagida (1987), Kim, Lee 
(1989), Hindmarsh (1997)] have different cos from those 
of the PQWW axion. For astrophysical application, we 
must keep both c4 and cs. The ¢ and c» terms respec- 
tively give the axial vector and pseudoscalar couplings. 
The axion operator in the flavor SU(3) space can be writ- 
ten as 


=u Ad E Ad 
Cro + C13 Cro + C13 
/3 6 


where F4 and F&/4/3 are the third component of the 
isospin and the hypercharge operators, respectively, and 
1 is the identity operator. 


(Ca — C2) Fs + 1 (64) 


Then, the derivative couplings with nucleons and mesons below the chiral symmetry breaking are defined as 


e O"a _ = r Tt " T 

AV — p CappPYu'Y5P + Conn psn + iConNN SEP al = qm (65) 
e Ll OO a pA 0,—5 4. 944-8 49 
Jr pm Cannan y; 7 (n nOn +r T Onn 2a*n Out), (66) 
where 
cu — 9cd Au zd cd — 9g Au ad 

Cop o FTD s : Cam — GF +t D + Tg (67) 
CanNN = a — E , oe = ae — ci) . (68) 


V2 


3 


Here the axial vector coupling parameters of the nucleon octet are given by F = 0.47, D = 0.81, S ~ 0.13 + 0.2 H 
Amsler et al. (Particle Data Group, 2008)]. For example, for the hadronic axion couplings we obtain the results given 


in [ril 215, Kaplan (1985), Chang, Choi (1994)], 
1 [25 1 
F+- D +S 
tara '6 
bx 
Can ne 
NN 2/2(1 + Z) 


Capp = 2(1+Z) 


For the DFSZ axion, there exist additional contributions 
from the extra terms in Eqs. EAB). 

Similar expressions might be attempted for the pseu- 
doscalar couplings in terms of a and the pseudoscalar 
coefficients F’, D' and S'. But, for the axion current, 
corresponding to J“, there does not exist an anomaly as 
discussed in Eq. a and we do not write down the axion 
pseudoscalar couplings. The anomaly carried by axion 
above the chiral symmetry breaking scale is left over to 


, 


Z Z—2 1 
Cann = > F + SS DY SS 
XI-Z) 60-2) 6 
1-Z 
Canna = Ria, aA? 69 
3(1+Z) (69) 
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7 below the chiral symmetry breaking scale and hence 
these pseudoscalar couplings are for the 7/ meson. The 
axial vector current of n’ to the nucleon octet N = q@q@q 
is, 


JT = fn Out! "P IAN yus ToN (70) 


where To is properly normalized TrTË = i or To = 


1/,/2Ny, and gy is determined by strong interaction 


dynamics. The original global symmetry breaking term 
(QU) is transferred to N;,Nge/o1"/fs which is actually 
the nucleon mass term, 

AL = —my Ñ, Nge" / f, + h.c. (71) 
For example, the SU(6) wave function of spin-up neutron 
is 

In!) = sug 4d! ud! — 2d! uid! — 2d! ul d! — 2u'd!d' 
-HAuld! d! — 2u! d! qd! — 2d! du! — 2d! d! u! + Ad! d! ul) 
(72) 

where the quarks are now interpreted as constituent 
quarks below the chiral symmetry breaking. At low en- 
ergy, the symmetry is the only relevant one for consid- 
eration. The octet charge o/ is determined by strong 
interaction dynamics. The ducaplet has a different U(1) 
charge o4. Two anomaly matching conditions, PQ-B-B 
and PQ-Qem-Qem, may be used but do not give a useful 
information because of many form factors. So, simply 
interpreting the PQ charges of the current quarks being 
transferred to the BORstHeHt quarks in the octet with a 
multiplcation factor g,, we obtain the PQ charge of neu- 
tron as the PQ charge of one constituent quark. Thus, 
NLNe has the phase o/ = 295 /2N,/N,. Guessing that 
gx is similar to the octet form factor gą œ 0.75 
Georgi (1984), p.100], a, is estimated as 1.22. 


2. Amion-photon-photon coupling 


As we calculated the c3 coupling for the KSVZ axion, 
we can calculate the axion-photon-photon coupling, by 
simply substituting the gluon lines by photon lines and 
the quark triangles by charged fermion triangles. Since 
we are interested in low temperature experiments, we 
consider the energy scale below s electron mass. There- 
fore, considering V, = Npw Fa, c0... calculated from the 


ayy 
PQ dara of charged fermions becomes 


TT (Q7)Q? 

0 em T3 
Cayy — Now ( ) 
Below the QCD chiral symmetry breaking scale, we chiral 
transform light quarks to obtain 


2 
up FEX 


e 
a 4 
Cayy 3272 Fp em (7 ) 


Day, = 
where 


Cayy = Caa — CXSB (75) 


where the chiral symmetry breaking effect is, including 
the strange quark mass effect, 


2(4 + 1.052) 


2———————~ = {1.762, 2.2 
1 + 1.057 barca 


CxSB = (76) 


19 


for the 20% allowance from the tree level chiral pertur- 
bation theory estimation Kaplan, Manohar (1986)]. 
For illustration, let us take cygp ~ 1.98 for Z ~ 0.5 bod 
Manohar, Sachrajda (2008)]. 

In the KSVZ model, c?,., is determined by the PQ 
charge carrying heavy fermions. If there is only one neu- 
tral quark for this, then c$,, would be zero. If there 
is only one PQ charge carrying BEN quark with the 
electromagnetic charge Qem, then Can = = Qa. But, in 
realistic models from a fundamental theory it is more 
likely that there exist many PQ charge carrying quarks, 
and the coupling given for one PQ charge carrying heavy 
quark is presented just an illustration. 

In the DFSZ model, we consider only light quarks and 
leptons. The PQ charges of H,, and Hq determine the 
PQ charges of u and d quarks. For the PQ charge of e, 
we have two possibilities: Hq gives mass to e and the PQ 
charge of e is the same as that of d, or Hy, gives mass to 
e and the PQ charge of e is the opposite to that of u: 


Qv2 2 2v2 1 
0 d 2 2 
= — — S = = P 3 —1 
Ay = ae Gao at ( 5) 3 +( " 
8 
= E electron mass by Ha (77) 
202 / 2 20 1 
0 d 2 2 u 2 
mE 243.3 1 =)" 3 
me vew G CU ) By | P 
2 
= E electron mass by Hi (78) 


where the PQ charges of H., 4 were chosen to be positive 
before. So, in applying Eq. (4), we must choose the 
PQ charges of light quarks to be positive and hence the 
signs of must be reversed. For the PQWW axion, 
the coupling is the same as those of with positive 
signs. 

For a general light axion, the axion-photon-photon 
coupling depends on the ultraviolet completion of the 
theory. If the axion mass is lighter than 2me, its lifetime 
is 


2953 F2? 3.65 x 1074 (eV y? 
T(a — 2y) = aan PER mei nne 
ayy em a ayy a 
0.8 x 10’ty / eV A? 
gg MM e (79) 
c2 m 
ayy a 


where we used Z ~ 0.5 and the age of universe ty ~ 
135% 107 s. Boris = the axion with 24 eV 
mass has the lifetime ty "E pzd. Moroi, Murayama 
(1998), Hannestad, Mirizzi, Raffelt, Wong (2008)]. 


3. Axion lepton couplings 


The tree level axion lepton (l) coupling arises in the 
DFSZ and PQWW axions where the lepton mass term 
through the PQ charges of H4 or H, defines the c; of Fig. 
The removal of c3 term does not change the coupling 


ci, and hence we obtain the following tree level couplings 
of axion and lepton: 


DFSZ axion : 
2 
Nos F liysla, lepton mass by Ha (80) 
2 
mm liysla, lepton mass by Hj, (81) 


where the PQ charges of H,,4, are chosen to be posi- 
tive. For the PQWW axion, just Fa is replaced by vgw. 
For the KSVZ axion, the axion lepton coupling occurs 
at higher order which is negligible in astrophysical appli- 
cations. For the generic very light axion, the couplings 
given in Eqs. are applicable. 

Even though, the tree level coupling of the axion with 
electron is absent in the KSVZ model, the axion-electron 
coupling is present at one loop through the c,,~, coupling 

, Srednicki (1985)] 


Fa sar A =| (82) 


ev) |" "m, 3142. me 


23x10 7 (75) E In 
where A is the chiral symmetry breaking scale and N p 
must be multiplied in models with Now 7 1. On the 
other hand, the DFSZ axion coupling to electron is 


3 m 
—11 a 
1.4 x 107 Xa (=) (=) (83) 


where Ng = Npw/2 is the number of families and Xq = 
sin? B = v2/vz for the case of Eq. (BO. 


D. Old laboratory bounds on Fa 


With the axion couplings discussed in Subsec. IQ 
one can estimate the axion production rates in vari- 
ous experiments. The null experimental results give the 
bounds on the relevant axion couplings. These are ex- 
tensively discussed in earlier reviews isi , Kim 
(1987), Cheng (1988), Peccei (1989)]. These old labora- 
tory bounds, immediately studied after the proposal of 
the PQWW axion, basically rule out the PQWW axion, 
ie. give the axion decay constant F, greater than O(10 
TeV), 


F, = 104 GeV : old laboratorybound. (84) 


IV. AXIONS FROM OUTER SPACE 


From Eq. (9), we note that the axion lifetime is longer 
than ty for m, ~ 24 eV, and this kind of axion is impor- 
tant in cosmology. For Ma < 23 keV with Cayy = 1, the 


axion lifetime is longer than 10 min, allowing solar gen- 
erated axions below this mass have time to reach Earth. 
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These examples illustrated the importance of studying 
low mass axion effects in astrophysics and cosmology. 

The window for F, obtained from the astrophysical 
and cosmological constraints is given by 


0.5 x 10? GeV € Fa $2.5 x 10 GeV (85) 


where the upper bound is understood with the initial 
misalignment angle of order 1. 


A. Axions from stars 


In this Subsec. we present the key arguments leading 
to the axion constraints from astrophysical sources. Ax- 
ions have very small masses and therefore can be emitted 
without important threshold effects from stars, in anal- 
ogy to neutrinos. The method to constrain axion models 
is basically the overall energy loss rate, whether using 
the individual stars (e.g. Sun, SN1987A) or the statisti- 
cal properties of stellar populations (e.g. the stars in a 
globular cluster being a test population) 4d Bod, Kolb, 
Turner (1990), Raffelt (1996)]. 

We may use the axion couplings to y,p,n, and e to 
study the core evolution of a star. The simple bounds are 
obtained just by comparing the energy loss rates by the 
axion emission and by the neutrino emission. Studying 
the evolutionary history of a star by the axion emission 
may give a stronger bound than the one obtained from 
the energy loss rate but may not be so reliable compared 
to the one obtained from the energy loss rate. Since there 
are good reviews on axion astrophysics p, 04), Bod B51) 
Turner (1990), Raffelt (1990, 2008), Amsler et al. (Parti- 
cle Data Group, 2008)], here we briefly comment on axion 
physics in stars (Sun, low mass red giants, supernovae) 
to cite the reliable F, bound. 

With the axion emission, the Sun consumes more fuel 
and needs an increased core temperature. From the Pri- 
makoff process y + Ze — a+ Ze in the hadronic axion 
models, Ref. , Schlattl, Weiss, Raffelt (1999)] gives 
the axion emission rate La ~ 3.7 x 10^? Lo with a 20% 
increase of the °B flux with the increased core temper- 
ature. The 5B neutrino flux gives the best bound on 
the solar axion emission rate. The measured 8B neu- 
trino flux 4.94 x 105cm-?s-! H, ld, SNO Collaboration 
(2005a, 2005b)| is consistent with the axion emission if 
La < 0.04Lc RI, Bahcall, Serenelli, Basu (2005)]. This 
translates to the Fa bound of Fa/Cayy > 2.6 x 106 GeV 
for La € 0.04Lo B23, Schlattl, Weiss, Raffelt (1999)]. 

For the axion-electron coupling such as in the DFSZ 
axion models, the axion emission from the globular clus- 
ters gives a useful F, bound Bog, Raffelt, Dearborn 
(1987)]. Stars in a globular cluster are assumed to have 
the identical Y (helium fraction) and metallicity frac- 
tion. The helium core before ignition is degenerate and 
the bremsstrahlung emission is very effective, whereas 
the Primakoff emission is suppressed by the large plasma 
frequency and the helium ignition does not give a use- 


ful Fa bound for the KSVZ axion. However, after he- 
lium ignition the core degeneracy is lifted, the Primakoff 
effect becomes important, and the consumption of he- 
lium fuel is accelerated by the axion energy loss, short- 
ening the helium-burning lifetimes. Horizontal branch 
stars in several globular clusters confirm the expected 
helium-burning lifetimes, which agrees with the stan- 
dard prediction and the axion losses should not exceed 
£a < 10 erg g~'s~! in the cores of horizontal branch stars 
[4 Bod Raffelt (1990), Catelan, de Freista Pacheco, 
Horvath (1996)], which leads to F,/c44., > 2 x 10"GeV, 
a factor of 10 improvement over the solar bound. Note 
that this globular cluster bound is for models with an 
appreciable axion-electron coupling. 


In the study of the axion emission in the small mass red 
giants, the processes y+Z — a+Z,e+Z — a+e+Z, and 
y+e— a+e were considered. The early studies were the 
simple comparison of the axion emission and the neutrino 
emission [16d 25d, Fukugita, Watamura, Yoshimura 
(1982a,1982b), Krauss, Moody, Wilczek (1984)]. In the 
study [124 Dearborn, Schramm, Steigman], it is summa- 
rized as Fa > 2.1 x 10’cayy GeV if the Primakoff process 
y-Z a4 Z dominates and F, > 3.7 x 10? sin? 8 
GeV if the Compton process dominates (which is for the 
DFSZ axion, viz. Eq. (0). The Primakoff process is 
present in any axion model, and hence the Primakoff pro- 
cess bound is almost model-independent except in the 
region Ma > 200 keV where a is too heavy to be pro- 
duced in the core of a star. But this threshold effect 
is irrelevant since the PQWW axion region is excluded 
already. Note however that there is no confirmed obser- 
vation of neutrinos from the small mass red giants unlike 
from the Sun and SN1987A, and the possibility of dom- 
inant axion emission from red giants is not excluded by 
observation 08, Raffelt (2008)]. For the DFSZ axion, 
the region Ma > 107? is excluded due to the large axion- 
electron coupling. For the hadronic axion, Ref. 
Raffelt, Dearborn] argue that the the axion mass greater 
than about 2eV/[(E/N — 1.95)/0.72] would reduce the 
helium-burning time scale and is not allowed. 


For the supernovae explosion, the core temperature 
can go much higher than the temperature in the igni- 
tion phase of Helium in the small mass red giant cores. 
For supernovae, therefore, nuclear reactions are more im- 
portant and the PF, bound can be very strong. So we 
use the axion couplings to nucleons discussed in Sub- 
sec. [IITC I]to study the core evolution of supernovae. In 
the beginning, the bounds on the axion decay constant 
were simply obtained by comparing the nuclear burn- 
ing rates of producing axions and neutrinos l 
Iwamoto (1984), Pantziris, Kang (1986)]. The discov- 
ery of SN1987A was important in that it propelled a 
great deal of interest anew in the calculation of axion pro- 
duction rate [193 Bid Bad Raffelt, Seckel (1988), 
Turner (1988), Mayle, Ellis, Olive, Schramm, Steigman 
(1988), Hatsuda, Yoshimura (1988)] In principle, the 
same kind of bound on F, could be obtained from the 
earlier supernovae studies. The studies after the dis- 
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covery of SN1987A were performed with the derivative 
coupling and quartic terms of Subsec. [ILC I] and ob- 
tained a bound Fọ = 10? GeV. But as pointed out in 
[58 bd. 213 B59, Choi, Kang, Kim (1989), Kang, Pantzi- 
sis (1991), Turner, Kang, Steigman (1989), Carena, Pec- 
cei (1989)] and Subsec. [ILC T] a proper treatment of 
nucleon states must be taken into account. For the axion 
emission from supernovae, one must constrain the energy 
output to e, < 1 x 10?erg g-!s-! [304, Raffelt. (1990)]. 
'The axion emission rate calculation of Ref. Raffelt 
(2008)] is 


€; = 3.0 x 10?" [erg gs 1] Ch Fey Ta somey F (86) 


where Facey = Fu/GeV,Ta,30mMev = T/(30 MeV), and 
F = O(1). In the supernovae explosion the axion emis- 
sion can be comparable to neutrino emission. Such rem- 
nant axions from all the past supernovae explosions may 
be around us but will be difficult to be detected because 
of the small 1/F, 08, Raffelt (2008)]. For the smaller F, 
region from supernovae explosion, axions can be trapped 
if the axion-nucleon interaction is strong enough. For 
the hadronic axion, it gives the bound on m, > 1 eV 

5d, Turner (1990), Raffelt (1990), and we have 
the hadronic axion window in the eV range. 


For the KSVZ axion and the MI superstring axion, 
Cj terms are present. For example we can simply take 
a= i and ¢ = 4, corresponding to Z = 0.5, and hence 
obtain Capp = $F + 7S c 0.17 for the KSVZ axion. 
Using Capp as Cn in Eq. (EQ), we obtain an Fa bound 
from supernovae, 


F, > 0.5 x 10° GeV. (87) 


The white dwarfs in the final evolutionary stage of 
low mass stars (M < 10 +2 Mo), with the theoreti- 
cal model implemented in the DFSZ model, may give a 
stronger bound on F, 03, Raffelt (1986)] for some re- 
gion of the DFSZ parameter tan 3 = (H,)/(H4). The re- 
cent study of the bremsstrahlung process gives the bound 
F, > 0.6 x 10!° sin? 6 GeV, and even fits the cooling di- 
agram nicely with F, ~ 1.2 x 10!° x sin? 8 GeV for Ha 
giving mass to electron pod Isern, García-Berro, Torres, 
Catalán (2008)]. Note that tan is known to be large 
(2 30) in SUSY GUT models, and the white dwarfs may 
give the strongest F; bound for some DFSZ axion mod- 
els. 


The axion-nucleon coupling gets enhanced in strong 
magnetic field. Magnetic fields as strong as B > 10!5 
Gauss in neutron stars have been assumed in the scalar 
virial theorem B74, Woltjer (1964). With B > 1029 
Gauss at the surface, the axion emission rate from neu- 
tron stars or white dwarfs will get enhanced by O(1) 
compared to the B = 0 case [19], Hong (1998)]. 


In summary, axions once produced in the hot plasma 
of a star most probably escape the core, taking out en- 
ergy. This contributes to the energy loss mechanism of 
a star and is used to constrain axion models. From the 


nucleon-nucleon-a coupling, SN1987A gives the strongest 
astrophysical bound on the axion decay constant, F, > 


0.5 x 10? GeV Bod, Bod B51, Turner (1990), Raffelt 
(1990), Raffelt (2008)]. 


B. Axions in the universe 


Axions with Mma Z 24 eV have a lifetime shorter 
than the age of the universe. In this case, axion de- 
cay can lead to photons that can be tested against the 
observed electromagnetic background of the universe, as 
in some spontaneously broken flavor symmetric models, 
Vj — vja — viyy Berezhiani, Khlopov, Khomeriki 
(1990)]. However, in this case the needed decay constant, 
109 GeV, is outside the current bound on Fa. 

The axion for ma < 24 eV has a longer lifetime than 
the age of the universe and can affect its evolution. The 
heavy thermal axions around the eV mass range of Fig. 
Xb) become the hot DM in the universe. For the 3- 
8 eV mass range, they accumulate in galaxy clusters 
where their slow decay produces a sharp line that, in 
principle, can be observed by telescope searches as sug- 
gested in mi Bershady, Ressell, Turner (1991)]. In this 
case, the neutrino and axion hot DM must be consid- 
ered together, which now constrains the axion mass to 
ma < 1.02 eV [189, [19d, Hannestad, Mirizzi, Raffelt, 
Wong (2007, 2008)], almost closing the hadronic axion 
window of 1-20 eV of Fig. 

But more attention is paid to axions behaving as the 
CDM candidate. The axion potential is almost flat as de- 
picted in Fig. [TT] Therefore, a chosen vacuum stays there 
for a long time, and starts to oscillate when the Hubble 
time H~! is comparable to the oscillation period (the 
inverse axion mass), 3H ~ ma. This occurs when the 
temperature of the universe is about 1 GeV ll Bod, 
Preskill, Wise, Wilczek (1983), Abbott, Sikivie (1983), 
Dine, Fischler (1983)]. There exists the domain wall 
problem in the standard Big Bang cosmology Sikivie 
(1982)]. The axion strings and the domain wall problem 
have been elegantly summarized in [33d, Sikivie (2008)]. 
'The axion cosmology is correlated to the reheating tem- 
perature Truy in the inflationary models, where one must 
deal with both the inflaton and the axion. The density 
perturbations produced by the perturbations of the infla- 
ton field is adiabatic, ÔPmatter/ Pmatter = (3/4) dprad/Praa- 
On the other hand, the perturbations produced by the 
fluctuations of the axion field is isocurvature. If the re- 
heating temperature Try is above the axion scale Fa, the 
limit on the isocurvature less than 3096 from the large 
scale structure data can be used Ba Beltrán, García- 
Bellido, Lesgourgues (2007)]. This will be commented 
more in Subsec. [IV C]on the anthropic argument. 

In supersymmetric models, the reheating temperature 
is constrained to Try < 10? GeV or 10’ GeV (if the 
gluino is lighter than the gravitino) from nucleosynthesis 
requirements in models with a heavy gravitino Eid 
Ellis, Kim, Nanopoulos (1984), Kawasaki, Kohri, Moroi 
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(2005)]. So, with SUSY the domain wall is not so prob- 
lematic. In this case, the problem of string radiated ax- 
ions requiring axion mass m, > 10^? eV id Ei 
Davis (1985), Harari, Sikivie (1987), Dabholkar, Quash- 
nock (1990)] is no longer problematic. 


FIG. 11: The almost flat axion potential. The misalignment 
angle is expected to be of order 1 but can also be very small 
as shown by the thick blue arrow. 


Axions are created at T ~ Fa, but the axion vacuum 
(a) does not begin to roll until the Hubble parameter 
reaches the axion mass 3H = ma, which occurs at T œ 1 
GeV. From then on, the classical field (a) starts to oscil- 
late. For a small misalignment angle, the energy density 
behaves like that in the harmonic oscillator m2 F? which 
is proportional to the axion mass times the number den- 
sity. Thus, its behavior is like that of CDM, which is 
the reason that the axion DM is CDM even though its 
mass is very small and its interaction strength is much 
weaker than “weak”. Even for a large misalignment an- 
gle, an adiabatic invariant J exists and one can estimate 
the current axion energy density. The axion field evo- 
lution with the adiabatic change of the axion mass has 
been considered before [à re Chang, Hagmann, Sikivie 
(1999, 1998)]. 

The temperature dependent axion mass [ied Gross, 
Pisarski, Yaffe (1981)] enters in the determination of cos- 
mic temperature T; where 3H(Ti) ~ m4(Ti). The new 
estimate of T, for F, « 1019 GeV is a bit below 1 GeV, 
Ty ~ 0.92 GeV [1d, Bae, Huh, Kim (2008)]. QCD has two 
phases: the quark-gluon phase and the chiral symmetry 
breaking hadronic phase. Near the critical temperature 
Te, these two phases are separated above and below Te. 
'The critical temperature is estimated as 1485172757) 
MeV for three (two) light quark flavors |52, Braun, Gies 
(2007)]. So, cosmology near T; needs an information on 
the temperature dependent axion mass. This region is 
in the boundary of weak and strong coupling regimes 
and it is very difficult to estimate the axion mass ac- 
curately. Early attempts in this direction are given in 
Bod Bad Bad, Steinhardt, Turner (1983), Seckel, Turner 
(1985), Turner (1986)]. 

The 't Hooft determinental interaction is given 
schematically in Fig. In the quark-gluon phase, we 
have the first diagram in the box, which is parametrized 
as —K~°(mymams/p*) cos((ca4-c3)0] where p is the effec- 
tive instanton size in the instanton size integration. Ref. 
Gross, Pisarski, Yaffe (1981), Eq. (6.15)] expresses 
the result as 

n(p, 0yel- 3 GN No) 1240) [$0 - Nr] (88) 
where A = mpT,A(A) ~ —45ln(1 + X?/3) + a(1 + 
yA~2/3)-8 with a = 0.01289764 and y = 0.15858, and 


the pre-factor n(p,0) is the zero temperature density 


1 
n(p,0) = Mumam. Cn (£p) 


4 2 2N PE 
—8mn^ 1 


Here, the parameters are £ = 1.33876 and Cy = 0.097163 
for N = 3. For the QCD coupling constant, we use the 
three loop result ld Amsler et. al. (2008), QCD by 
Hinchliffe], 


& ( ) 92) Am | |. 251 In[In(u? / A 65)] 
cu) = 9S = gos | e OSA) 


x ( (m [ma2/2855)] - 3s 


ie - 3) e» 
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where fo = 11 — £N5, 8| = 51 — N}, and b2 = 2857 — 

S N HESENT. At T = Toev GeV (from 700 MeV to 1.3 

GeV), let us parametrize the instanton size integration of 
as 


V(0) = —C(T) cos(0), (91) 
where 0 = a/ F; and C(T) is 
C(T) = ox GeV*(Taey) ^. (92) 


We obtain Qinst = 2.858 x 1071? (9.184 x 10712, 7.185 x 


———————K— 


VZ 


Ppa(Ty = 2.73K) = ma(Ty)na(Ty) f1 (02) = Teo" ae Ti 3 


where fi(05) is the anharmonic correction and we used 
Z = m,/mq c 0.5, m, = 135.5MeV, fr = 93MeV and 
Jxs(present) = 3.91. y is the entropy increase ratio from 
extra particles beyond the SM. This becomes roughly 
1.449 x 19-1! & (He) F(0;, n) eV4, where 6; is 
the initial misalignment angle at 7j and 05 is the an- 
gle at somewhat lower temperature 75 where the adi- 
abatic invariant J is calculated. The total correction 
factor F(01,n) takes into account the anharmonic ef- 
fect and the initial overshoot of the misalignment an- 
gle, presented in |19, Bae, Huh, Kim (2008)]. For the 
critical density pe = 3.9784 x 10-1!(h/0.701)? (eV)* 
and Agcp = 380-r 60 MeV, the axion energy fraction, 
in terms of Fa only, is [1d, Bae, Huh, Kim (2008) 


Qa = 0.397ABC (a) e (95) 


h 


23 


10-3),n = 6.878 (6.789,6.967) for Agcp = 
380 (440, 320) MeV [Id, Bae, Huh, Kim (2008)]. Equat- 
ing 3H(T) and m(T) = J/C(T)/F2, we obtain the fol- 
lowing T; for Agcp = 380 MeV lid Bae, Huh, Kim 


(2008)], 
Fa aey VO 
Ti ev = 0.889 ( — ) l (93) 
For F, = 10!?GeV, we obtain T, c 0.89 GeV. This 


number is smaller than those given in 1980s because we 
used the smaller number for the product of current quark 
masses ™,mqm, based on the recent compilation of light 
quark masses ba Manohar, Sachrajda (2008)]. 


(1)No sudden change of ma(T): 


Since the potential varies much more slowly than the 
field itself, we can use the so-called ‘adiabatic invariant 
theorem’ that if the potential is adiabatically changed, 
the area in the phase space swept by the periodic mo- 
tion is unchanged per one axion oscillation , Landau, 
Lifshitz (1976)]. In this case, for a small misalignment 
angle the adiabatic invariant is p(t)/m(t) which can be 
interpreted as the conservation of the total axion number. 
For a large 61, the invariant is not the axion number den- 
sity, but the CDM energy density which can be related 
to the axion number density by a correction factor [19, 
Bae, Huh, Kim (2008)]. If we apply this until now, we 
obtain 


3- 1.669.5(Ty)T? Fa 05 fi (02) 


m 


where A = (mymams/3-6-103 MeV*)~°-?, B = 
(F,/1012 GeV)1184-0.0107 with z = (Agcp/380 MeV) — 
1, and C= (Aqcp/380 Mey -9-695, 


(2)Sudden change of ma(T): 


Let us try to calculate the misalignment angle be- 
low the critical temperature of chiral symmetry break- 
ing where a sudden phase change is experienced near the 
critical temperature Te. The QCD interaction for light 
quarks below 1 GeV can be written as 


L= (murur + madrdg + ms5LSR + h.c.) (96) 

—K-5(ürupgdrdgSrpsge i8 + h.c.) 
where K has the mass dimension arising from QCD in- 
stanton physics. The ’t Hooft determinental interaction 
‘t Hooft (1976)] written above is equivalent to the 
anomaly term and has the same chiral symmetry behav- 


ior. For T. S T S Fa, quark bilinears are not developing 
VEVs, and the relevant determinental interaction for the 
axion is the first diagram inside the box of Fig. [9] Now, 
the importance of the determination of T, is how much 
the misalignment angle 0, can be shrunk at Te. 
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FIG. 12: Phase transition near the critical temperature Te ~% 
150 MeV. 


In the hadronic phase below the critical temperature 
Te, the axion potential is schematically depicted in Fig. 
The value 01 (Te) is the boundary value at Te we use 
in the effective Lagrangian below Te. Below Te, the quark 
bilinears develop VEVs and we must consider the possi- 
bilities of {qr replaced with (grqm). The effective La- 
grangian from the determinental interaction is depicted 
in Fig. 

In the limit Fa >> f’, the mass eigenstates in one flavor 
QCD are 


, 1 =f a 
mass ~ f' Fe , Ümass ~ Eme (97) 
14-m/ K' 1 


Eq. (Z8) with v? = 0 has minima at 0 = 27n (n = 
integer). For v? Æ 0, minima are at 0y = 2mm (m = 
integer) and 0 = 2zn (n = integer). Therefore, the 0 
direction can be taken as the approximate axion direction 
even below Te. The minimum point in the direction of the 
axion is not changed when one goes from 6, 4 0 to 0y = 
0, i.e. above and below the critical temperature. [If the 
minimum of 0 were shifted by 7 from going from 0, = 0 
to 0, = m, the shrunk 01(T.) at Te is near 7, and we 
must start from O(1) misalignment angle at T,.] In most 
regions of the phase transition space, a time scale At is 
needed for the sound wave of quark bilinears to propagate 
to a large distance, which releases the latent heat to keep 
the constant temperature during the first order phase 
transition in Rz Mukhanov (2005), p.149]. Even if one 
considers supercooling toward a sudden phase transition, 
the parameter space for a sudden phase change is almost 
nill and the axion energy density presented in (Q5) is 
reliable [19, Bae, Huh, Kim (2008)]. 

In Fig. we present the exclusion plot for m, = 
2.55 MeV, m4 = 5.04 MeV and m, = 104 MeV bog, 
Manohar, Sachrajda (2008)| in the Fa versus 61/,/7 
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1.5} Over Closure 
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FIG. 13: The plot for Fa versus the misalignment angle 61 / /7 
as a function of Qa. The overclosure portion is from the preci- 
sion measurement requiring Qa < 0.23 . The green region 
is the excluded region by the condition Qa 2 0.23. The yellow 
band is the error bar region of Agcp and two red lines are 
the limits from the light quark mass bounds. 


space, including the anharmonic effect and the WMAP 
value [I5d, Dunkley et al. (WMAP Collaboration, 2008)] 
of CDM density combined with additional data bad Ko- 
matsu et al. (WMAP Collaboration, 2008)] Qpmh? ~ 
0.1143 + 0.0034. Note that E, of order 10!? GeV is 
not very unnatural, which results from the new smaller 
masses for u and d Manohar, Sachrajda (2008)]. 


If the axion is the CDM component of the universe, 
then they can be detected even though it may be very 
difficult. The feeble axion coupling can be compensated 
by the huge number of axions, since the number density 
is ~ F? and the cross section is ~ 1/F?. So, there is 
a hope to detect cosmic axions, which has been realized 
by Sikivie's cavity detector Sikivie (1983)]. But 
the Sikivie detector has the technical limitation for the 
interesting large and median regions of the F,, window. 
For example, the F, region F, > 10!? GeV advocated 
in anthropic arguments needs too large cavity size and 
the supergravity mediation preferred region PF, ~ 5 x 
101? GeV requires 1.6 mm order cavities. For technically 
preferred axion masses in the region 10^ eV, one needs 
a low temperature cavity with dimension O( 104 cm?) 
and a magnetic field strength of O(10 Tesla). The current 
status of cosmic axion search is shown in Fig. I 
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FIG. 14: The bounds on cosmic axion searches with some 
theoretical predictions. The coupling on the vertical axis is 
the coefficient of E- B. The future CARRACK and ADMX 
experiments are from Boj, 


C. Axion cosmology beyond the window 


If Fa > 10” GeV, an O(1) misalignment angle 0, 
is ruled out from the cosmic energy density argument. 
However, if 6; < 1, the axion energy density can be 
within the closure density. Rather than fine-tuning 04 
to order 10^? for a Planck scale F, Pi (1984)], 
the anthropic argument of Weinberg ; , Weinberg 
(1987), Linde (1988)], that life forms can evolve in a uni- 
verse with a sufficiently long lifetime, can be used for an 
allowable 01. 

The homogeneous axion field value (with a —^ —a sym- 
metry) right after inflation can take any value between 0 
and tF, or 01 = [0, 1] because the height of the axion po- 


D. Quintessential axion 


In light of SUSY breaking in supergravity, it is gen- 
erally believed that at least a hidden confining force is 
needed at an intermediate scale. This hidden sector and 
the observable sector couple extremely weakly in most 
phenomenological models. This scheme fits very well in 
the heterotic string framework and in heterotic M-theory. 
In cosmology, on the other hand, we have the important 
dark energy problem already for a decade bod, , Per- 
mutter et al. (1998), Riess et al. (1998)], which has led 
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tential is negligible compared to the total energy density 
right after inflation. So, in the axion context with only 
the misalignment production of axions, the CDM den- 
sity is chosen as a random number by the spontaneous 
symmetry breaking of the U(1)pq. Even in the multi 
component CDM models, including axion, the axion mis- 
alignment angle can play as the random number. This 
singles out axion physics, as stressed in Bad Tegmark, 
Aguirre, Rees, Wilczek (2006)], from all other anthropic 
arguments without axion in that selecting an axion vac- 
uum has an unavoidable random process that fixes the 
key cosmological parameter. This also distinguishes ax- 
ions from WIMPs, super-WIMPs etc. where the abun- 
dance is fixed by particle physics parameters and not by a 
primordial random process. So Qa may be at the required 
value by an appropriate initial misalignment angle in 
models with axions with F, > 10!? GeV. Tegmark et al. 
studied the landscape scenario for 31 dimensionless pa- 
rameters and some dimensionful parameters with which 
habitable planets are considered for the assumed nuclear 
physics parameters [30, Barr, Seckel (1992)]. For exam- 
ple, Fig. 12 of |845, Tegmark, Aguirre, Rees, Wilczek 
(2006)] presents the scalar fluctuation Q ~ óp/p vs the 
matter density per CMB photon £, in which the anthrop- 
ically chosen point is shown as the star. In models with 
axion, this point results from a random number after in- 
flation. If à WIMP is the sole candidate for CDM, one 
obtains just one number for óp/p from particle physics 
parameters, which may not fit to the observed point of 
that figure. So, we may need the CDM favored WIMP 
and in addition the axion with F, > 10!? GeV, with the 
axion CDM fraction Ra = Qa/Qcpm. But this large Fa 
anthropic region has a potential conflict with the WMAP 
5 year data, as presented in the Fa vs. E;(= the inflation 
energy scale) plane of Fig. 2 of [194 Herzburg, Tegmark, 
Wilczek (2008). For Ra = 1, for example, F, > 10!4 
GeV is inconsistent with the WMAP 5 year data on the 
upper bound on the isocurvature fluctuation a, « 0.072 
Komatsu et al. (2008)]. 


From the study of outer space axions, we present a 
cartoon for the F, bound in Fig. where the future 
CAST and ADMX experimental regions are also marked. 


to a lot of interest in quintessence models since the late 
1980s Wetterich (1988)]. The quintessence related 
to axion physics is called *quintessential axion" (QA) 
which was suggested in ba Kim, Nilles (2003)]. There 
were attempts to identify one of the MD axions as the 
quintessential axion [82, 232, Kim (2000), Choi (2000)]. 


To explain the dark energy in terms of a QA, one re- 
quires the VEV of QA not to roll down until recently. Of 
course, it is required for the current vacuum energy den- 
sity of the classical QA to be of order A* ~ (0.003 eV)4. 
These two conditions restrict the QA decay constant f, 
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FIG. 15: A cartoon for the Fa bounds. 


and the QA mass m4. We can parametrize the QA (9) 
potential as 


Vid =A UE, 6-7. (98) 
q 
For w = p/p < -1- ô, we require f, > 


/(2—5)/65 Mp|U'| where U' = dU/dé [Al Kim, 
Nilles (2003)]. Generically, one needs a Planckian scale 
quintessential axion decay constant fq. So, the QA mass 
is extremely small, X 10783? eV. As a result, there are two 
problems to be resolved to achieve the QA idea: a large 
decay constant and an extremely shallow QA potential. 


It has long been believed that the MI axion has rather 
a robust model independent prediction of its decay con- 
stant [8d, B43, Choi, Kim (1985), Svrcek, Witten (2006)]. 
Recently, however, it was shown that the MI axion may 
not be model independent since the decay constant may 
depend on the compactification scheme in warped inter- 
nal space, ds? = h2n,,dz"dz" + gmn(y)dy™dy” 
Dasgupta, Firouzjahi, Gwyn (2008)], 


2 2 

Be e RE 

B Mp 

where 8 depends on the warping in the compact space 
yEK, 


(99) 
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Thus, the MI axion with a small @ can be a QA if the 
QCD axion decay constant can be in the intermediate 
scale. This possibility may be realizable in some com- 
posite axion models as recently suggested in [43 Kim, 
Nilles (2009)]. 


B (100) 


V. AXION DETECTION EXPERIMENTS 


'There are currently a variety of experiments searching 
for axions, whether they are left over from the big bang 
or produced in stars or the laboratory. Though these 
experiments search for axions at a variety of mass and 
coupling scales they all rely on the Primakoff process for 
which the following coupling, Cayy is given in Eq. (Z3. 


£- Cary uc Fon Fen]; Cayy m Eayy — 1.98 (101) 


= 2 
where Gayy = TrQz4 | gs» Mz - 


A. Solar axion search 
1. Axion Helioscopes 


Axions produced in the nuclear core of the sun will 
free-stream out and can possibly be detected on Earth 
via an axion helioscope, first described in 1983 B34 B34 
Sikivie (1983, 1985)] and developed into a practical labo- 
ratory detector in 1988 b53 van Bibber, McIntyre, Mor- 
ris, Raffelt (1989)]. The technique relies on solar axions 
converting into low energy X-rays as they pass through 
a strong magnetic field. The flux of axions produced in 
the sun is expected to follow a thermal distribution with 
a mean energy of (E) = 4.2 keV. The integrated flux at 
Earth is expected to be ®a = 9253.67 x 10! cm~?s7! 
with gio = (Gem/27Fa)Cayyl0'° GeV Bd, Ziotas et al. 
(2005)]. The probability of a solar axion converting into a 
photon as it passes through a magnet with field strength 
B and length L is given as: 


Bi QemCayyBL ? pl d cos(qL) 
An Fa (qL)? 


Here c444, is defined as the coupling of the axion to two 
photons as given in Eq. (QJ), while q is the momentum 
difference between the axion and the photon, defined as 


(102) 


q = m2/2E where E is the photon energy. To maintain 
maximum conversion probability the axion and photon 
fields need to remain in phase over the length of the mag- 
net, thus requiring qL « 7 [35d van Bibber, McIntyre, 
Morris, Raffelt (1989)]. For low mass axions q — 0 lead- 
ing to a maximum conversion probability. More massive 
axions will begin to move out of phase with the photon 
waves though this can be compensated for by the additon 
of a buffer gas to the magnet volume, thus imparting an 
effective mass to the conversion photon van Bibber, 
McIntyre, Morris, Raffelt (1989)] and bringing the con- 
version probability back to the maximum. Various axion 
masses can be tuned to by varying the gas pressure. 

An initial axion helioscope was built at Brookhaven in 
1992 and used a 2.2 ton iron core dipole magnet oriented 
at Sun with a proportional chamber for X-ray detection 
254, Lazarus et al. (RBF Collaboration, 1992)]. It was 
followed by a 4 tesla superconducting helioscope, devel- 
oped by the University of Tokyo, which ran for 1 week 
with an evacuated bore in 1997 237) Moriyama et al. 
(1998), Ootani et al. (1999)] and for 1 month with a he- 
lium filled bore in 2000 202. Inoue et al. (2002)]. Though 
both managed to set limits over a wide mass range their 
sensitivities were still well above even the most optimistic 
KSVZ axion couplings. Recently though, the University 
of Tokyo group released data taken between December 
2007 and April 2008, which was able to set a limit of 
gay, < 5.6 — 13.4 x 10-1? GeV"! for the axion in the 
mass range 0.84 < Ma < 1.00 eV Inoue et. al. 
(Tokyo Axion Helioscope Collaboration, 2008)]. 

In order to push into proposed axion model space 
third generation axion helioscopes have been developed 
at CERN (the CERN Axion Solar Telescope (CAST)) 
and at the University of Tokyo. Utilizing a prototype 
Large Hadron Collider (LHC) magnet with L — 9.3 m 
and B — 9 tesla CAST began taking data in 2003. It 
utilizes a rail system to track Sun for 90 minutes a day 
at sunrise and sunset and its dual magnet bore allows 
it to employ up to four different X-ray detectors (one 
on each end of each magnet bore). Currently a time- 
projection chamber (TPC), a Micromegas (micromesh 
gaseous structure) detector and an X-ray reflective tele- 
scope with a charge coupled device (CCD) detector are all 
used to detect converted X-rays. Results from the com- 
bined 2003 and 2004 runs yield limits on axion-photon- 
photon couplings to Cayy/Fa < 7.6 x 10 9GeV ^! mi 
Andriamonje et al. (CAST Collaboration, 2007)]. The 
experiments second phase utilizing ^He and ?He buffer 
gases is currently underway with the latter gas allowing 
for axion searches in proposed model space up to mass ~ 
1 eV. 


2. Bragg Diffraction Scattering 


An alternative to axion helioscopes was proposed in 
1994, using crystal detectors to search for X-rays gener- 
ated by coherent axion-to-photon conversion which meet 
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the Bragg conditons bod, Paschos, Zioutas (1994)]. Var- 
ious dark matter WIMP search collaborations were able 
to look through their data sets and set limits on possible 
interactions from solar axions. These included germa- 
nium experiments such_as COSME p74, Morales et al. 
(2002)] and SOLAX [I], Avignone et aL. (1998)], the 
reactor germanium experiment TEXONO m Chang et 
al.], as well as the DAMA experiment mi , Bernabei 
et al. (2001, 2003)] which utilized Nal crystals. The lim- 
its from these searches can all be seen in Fig. One 
advantage of this technique is that their sensitivity is in- 
dependent of axion mass, as long as one can neglect any 
nuclear recoils Gd, Carosi, van Bibber (2007)]. 

Current results for solar axion searches can be seen in 
Fig. 
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FIG. 16: Exclusion plot of axion-photon coupling versus axion 
mass bl. The black bold line limit is for the phase one of the 
CAST experiment and results with inclusion of buffer gas are 
expected to push up in mass reach to plausible axion models. 
The field theoretic expectations are shown together with the 
string theory Zio.; model of fo}. In the lower left apricot 
box, Fig. [jis located. 


3. Geomagnetic Conversion 


It has recently been pointed out [123 Davoudiasl, Hu- 
ber (2006)] that solar axions might pass through Earth 
and convert to X-rays on the other side as they pass 
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FIG. 17: Outline of the general configuration of a resonant 
microwave cavity detector along with the associated singal ex- 
pected from axion-photon conversions. This diagram includes 
both the virial component as well as possible lines from co- 
herent axions. 


through the earth’s magnetic field. They could then be 
detected by X-ray telescopes and the solar X-ray back- 
ground could be effectively shielded by Earth. 


B. Search for cosmic axions 


Cosmic axions left over from the Big Bang may be de- 
tected utilizing microwave cavity haloscopes , 
Sikivie (1983, 1985)]. The strategy relies on primordial 
axions drifting through a microwave cavity immersed in a 
strong static magnetic field in which they can resonantly 
convert to microwave photons. The cosmic axions feeble 
interactions can be in part compensated by their large 
numbers; since the number density goes as ~ F? while 
their cross section goes as ~ 1/F?. If the axion makes up 
the majority of CDM in the universe, its local density is 
expected to be roughly 0.45 GeV/cm? Gates, Gyuk, 
Turner (1995)], which yields a number density of ~ 1014 
axions/cm? if one assumes a 4.5 pweV axion. The expected 
microwave signal will be a quasi-monochromatic line be- 
ginning at the microwave frequency corresponding to the 
axion mass and slightly broadened upward due to the ax- 
ion virial distribution, with expected velocities of order 
1078c implying a spread in energies of E/E ~ 10~°. 

There could also be an additional signal from non- 
thermalized axions falling into the galaxy’s gravitational 
well which would yield very ut signals due to their 
low predicted velocity dispersion (< 1077c ) B33, Sikivie 
(2003)]. 


1. General Detector Properties 


Since the Lagrangian for axions coupling to a magnetic 
field goes as 


Lavy = — aE.B, (103) 
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the only resonant modes which can couple to axions 
are those that provide an axial electric field component 
(transverse magnetic or TM modes). The expected power 
generated from axion-to-photon conversions in the cavity 
is given by [334 Sikivie (1985)] 


2 
m QemCayy 2 1 1 
P^ = ( F, ) V Bo PaCimn Ma mun (Qr, Qa) 


EE V Bo a C 2 
10-26 Do EN cn 
SHORT v (sx) (=) "i om) 
p —R 
0.5 x 10724 g cm-3 


* (zac) i (92.92 


where V is the cavity volume, Bo is the magnetic field 
strength, p is the local axion mass density, m, is the axion 
mass, Cymn is a form factor which describes the overlap 
of the axial electric and magnetic fields of a particular 
TMimn mode, Qz is the microwave cavities loaded qual- 
ity factor (defined as center frequency over bandwidth) 
and Q, is axion quality factor defined as the axion mass 
over the axion's kinetic energy spread. The mode depen- 
dent cavity form factor is defined as 


(104) 
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Cimn = E 
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(105) 


where E,(z)e/"' is the oscillating electric field of the 
TMimn mode, Bo(Z) is the static magnetic field and e 
is the dielectric constant of the cavity space. For a cylin- 
drical cavity with a homogeneous longitudinal B-field the 


T01o mode yields the largest form factor with C919 œ% 0.69 
id Bradley (2003)]. 


The mass range of cosmological axions is currently 
constrained between jjeV and meV scales which corre- 
sponds to converted photon frequencies between several 
hundred MHz and several hundred GHz. Since larger mi- 
crowave cavities correspond to lower resonant frequencies 
and lighter axions are more likely to contribute to the 
dark matter density experiments have been designed to 
start searching at the low end of the frequency range. At 
these frequencies cavities can only scan a few kHz at a 
time in order to maintain the maximum quality factor. 
Axial metallic and dielectric tuning rods are utilized to 
tune the cavities resonant frequency as it scans over the 
possible axion mass range. The scan rate is determined 
by the amount of time it takes for a possible axion signal 
to be detected over the microwave cavity’s intrinsic noise 
and is governed by the Dicke radiometer equation [23 
Dicke (1946)], 


avi = = 


SNR = 
— Bo 


(106) 


Here P, is the power generated by axion-photon conver- 


sions (Eq. (1029), Pv = kp BTs is the cavity noise power, 
B is the signal bandwidth, t is the integration time, kp is 
Boltzmann's constant and Ts is the system temperature 
(electronic plus physical temperature). The scan rate for 
a given signal to noise is given by 


df  12GHz( 4 V( V B (107) 
di -— yr SNR 5001 TT 
2 
re 4 p 
x C? (222) — ha 
0.72/ \ 0.45 GeV /em? 
, (3KY (_f \" Qu 
Ts GHz Qa 
One can see from Eq. (L0Q) that even a tiny expected 
signal power can be made detectable by either increasing 
the signal power (P, x V Bg), increasing the integration 
time t or minimizing the system noise temperature Ts. 
Technology and costs limit the size and strength of the 
external magnets and cavities and integration times are 
usually t ~ 100 seconds in order to scan an appreciable 
bandwidth in a reasonable amount of time. As a result 


the majority of development has focused on lowering the 
intrinsic noise of the first stage cyrogenic amplifiers. 


2. Microwave Receiver Detectors 


Initial experiments were undertaken at Brookhaven 
National Laboratory [2d DePanfilis et al. (Rochester- 
Brookhaven Collaboration, 1987)| and the University of 
Florida [185, Hagmann et al. (U. of Florida, 1990)], but 
their modest sized cavities and magnet fields meant they 
were still factors of 10-100 times away from plausible ax- 
ion model space. There are currenly two active second 
generation experiments underway, the Axion Dark Mat- 
ter eXpereiment (ADMX) at Lawrence Livermore Na- 
tional Laboratory (LLNL), USA and the Cosmic Ax- 
ion Research with Rydberg Atoms in Cavities at Kyoto 
(CARRACK) experiment in Japan. Both experiments 
utilize large microwave cavities immersed in a strong 
static magnetic field to resonantly convert axions to pho- 
tons but they go about detecting these photons in two 
different ways. ADMX uses ultra-sensitive microwave re- 
ceivers while CARRACK uses Rydberg atoms to detect 
single photons. 

The ADMX experiment is a collaboration of LLNL, 
MIT, the University of Florida, Lawrence Berkeley Na- 
tional Laboratory (LBNL), U.C. Berkeley, U. of Chicago 
and Fermilab, and has been operating in various modes 
since February, 1996. A diagram of the experiment can 
be seen in Fig. ADMX consists of an 8.5 Tesla super- 
conducting magnet, 110 cm in length with a 60 cm clear 
bore. A 200 liter stainless steel microwave cavity plated 
in ultra-pure copper is suspended below a cryogenic stage 
in the center of the B-field. Power generated in the cav- 
ity is coupled to an adjustable antenna vertically input 
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FIG. 18: Schematic of the ADMX experiment bd. 


through the top cavity plate. Any signal is then boosted 
by extremely low noise cryogenic amplifiers before being 
sent through a double-heterodyne mixing stage. Here the 
GHz range signal is mixed down to an intermediate 10.7 
MHz, sent through a crystal bandpass filter, and then 
mixed down to audio frequencies at 35 kHz. This audio 
signal is then analyzed by fast-Fourier-transform (FFT) 
electronics which measure over a 50 kHz bandwidth cen- 
tered at 35 kHz. There is also a “high resolution” chan- 
nel in which the signal is mixed down to 5 kHz and sent 
through a 6 kHz wide bandpass filter. Time traces of the 
voltage output, consisting of 22? data points, taken with 
a sampling frequency of 20 kHz is then taken, resultin 
in à 52.4 second sample with 0.019 Hz resolution [Lid 
Duffy et al. (2006)]. 

Since the system noise is dominated by the first stage of 
amplification great care was taken in choosing the cryo- 
genic amplifiers. The initial ADMX data runs utilized 
Heterojunction Field Effect Transistor (HFET) ampli- 
fiers developed by the National Radio Astronomy Obser- 
vatory (NRAO) [I2 Daw, Bradley (1997)]. Even though 
they had noise temperatures of only 2 K, the quantum 
noise limit at a GHz (defined as T; = hv/kg) is only 
50 mK. As a result a great deal of development went into 
replacing the HFETs with more sensitive Superconduct- 
ing Quantum Interference Devices (SQuIDs) which had 
noise temperatures of only 1596 the quantum limit Ld 
Bradley (2003)]. As of this writing data is being taken 
using the SQuIDs for the first stage of amplification. 

Results from the initial run using HFET amplifiers 
have already probed plausible axion model space in the 
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FIG. 19: Medium resolution limits from the ADMX experi- 
ment. The left-side plot shows limits to axion coupling assum- 
ing a dark matter halo density of p = 0.45 GeV/cm® (upper 
region excluded). The right-side plot shows limits on the ax- 
ion contribution to the dark matter halo density assuming the 
axion has either the KSVZ or the DFSZ couplings 
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FIG. 20: High resolution limits from the ADMX experiment. 
Limits given in terms of axion contribution to the dark matter 
halo density assuming the axion has either the KSVZ or the 
DFSZ coupling strength. The medium resolution plot for that 
mass range for the DFSZ coupling is also given as red 

The KSVZ axion with eg = 0 shown above gives pa at Earth 
less than 0.16 GeV /cm? which corresponds to Qa < 0.36. So 
the ADMX line of Fig. [f] using Eq. (95) crosses the eg = 0 
KSVZ line and goes down to the DFSZ line. 


axion mass range between 2.3 - 3.4 u eV ml Bradley 
(2003). Results from a high resolution search have 
probed further into coupling space over a smaller mass 
range, 1.98-2.17 ueV Duffy et al. (2006)]. As of 
this review ADMX is scanning over the mass range cor- 
responding to 800-900 MHz using SQuID amplifiers. 


3. Rydberg Atom Detectors 


The CARRACK experiment has published proof of 
concept papers for their detection technique using Ryd- 
berg atoms as opposed to low noise amplifiers , Tada 
et al. (2006)]. The experimental setup can be seen in 
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FIG. 21: General schematic of CARRACK experiment utiliz- 
ing Rydberg atoms to recover single photons generated in the 
microwave cavity lad, 


Fig. BI] In it rubidium atoms are excited into a Ryd- 
berg state (|0 >— |n >), and move through a detection 
cavity coupled to an axion conversion cavity. The spac- 
ing between energy levels is tuned to the appropriate fre- 
quency utilizing the Stark effect and the Rydberg atoms 
large dipole transition moment ensures efficient photon 
detection (one photon per atom, |ns >— |np >). The 
atoms are then subjected to a selective field ionization 
allowing the atoms with the higher energy state (|np >) 
to be detected lad, Carosi, van Bibber (2007)]. The ad- 
vantage of this system is that Rydberg atoms act as single 
photon detectors and thus do not suffer from quantum 
noise limitations. 

Though still in the development phase CARRACK has 
already gone through two iterations, CARRACK 1 and 
CARRACK 2, and it has measured cavity emission at 
2527 MHz down to a temperature of 67 mK, which is 
a factor of two below the quantum noise floor for that 
frequency. Their eventual goal is 10 mK , Tada et 
al. (2006)]. One disadvantage of this technique is that 
one cannot detect signals finer than the bandpass of the 
cavity, of order ~ 107, which negates searches for late 
infall coherent axions. 


C. Laser searches 


In addition to cosmological and solar axion searches 
there is also a class of laboratory axion searches which 
utilize laser photons (Miaser) traversing a magnetic field. 
Here the polarized laser photons can scatter off virtual 
photons (yv) provided by the magnetic field and con- 
vert into axions Yaser + ^, — a. Currently laser axion 
searches have fallen into two general categories. The first 
technique looks for magneto-optical effects of the vac- 
uum due to polarized laser photons disappearing from 


the beam as they are converted into axions. The second 
looks for photons converting into axions in the presence 
of a magnetic field, which are then transmitted through 
a wall and converted back into photons by a magnetic 
field on the other side, so called “light shining through 
walls” experiments. 


1. Polarization shift of laser beams 


There can be the axion-photon-photon anomalous cou- 
pling of the form aE-B. The laser induced axion-like par- 
ticle search employing this coupling has been performed 
since early 1990s by the Rochester-Brookhaven-Fermilab- 
Trieste (RBFT) group |56, Cameron et al. (1991)]. A few 
years ago, the same type experiment by the PVLAS col- 
laboration was performed with an initial positive signal 
with E, ~ 106 GeV B74, Zavattini et al. (2006)] we dis- 
cussed earlier. This has led to some exotic models where 
a vacuum dichroism is achieved by producing axion-like 
particles as shown in Fig. 22a). Because of the non- 
renormalizable interaction implied in Fig. 22{a), one ma, 
reconclie this model with the astrophysical bound 
Mohapatra, Nasri (2007)]. Or if light milli-charged par- 
ticles are produced in the strong magnetic field as shown 
in Fig. b) a vacuum dichroism is achieved as dis- 
cussed M 269, Gies, Jaeckel, Ringwald (2006), 
Masso, Redondo (2006), Kim (2007)]. Here, the polar- 
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FIG. 22: Possible processes leading to a vacuum dichroism. 


ization of the laser beam is looked for. With more data 
accumulation, there is no convincing evidence for an ax- 
ionlike particle with F, ~ 109 GeV at present, contrary 
to an earlier confusion kd, Dod [151] Ed Bzd Zavattini 
et al. (2007), Chou et al. (2008), Yoo (2007), Chen et 
al. (2006), Dupays et al. (2005)]. But, this incident has 
lead to the current search of axionlike particles at DESY 
, Ringwald (for the 4th Petras Workshop) (2008)]. 


2. Light shining through walls 


The “Light shinging through walls” technique for 
searching for axions was first proposed in 1987 by 
van Bibber et al. (1987)] and recently a model study 
has been presented 4, Adler, Gamboa, Mendéz, López- 
Sarrión (2008)]. The general experimental layout can 
be seen in Fig. where polarized laser photons pass 
through the magnetic field with E||B and any converted 
axions (or other psuedoscalar particles) can continue 
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FIG. 23: Schemtic of “light shining through walls” experiment 


through an absorber to be re-converted to photons on 
the other side. 

'The probability for a photon to convert into an axion 
as it traverses the “Axion Source" region is given by 


2 
Bof gli gy, | Posen 
2n FE, (qL)? 


4 
This is the same probability for an axion to convert back 
into a detectable photon in the *Axion Detector" region 
on the otherside of an absorbor, which leaves the total 
probability for detecting a photon-axion-photon conver- 
sion as Py_.q, = P2 ,, (ignoring photon detection ef 
ficiencies of course) Battesti et al. (2007). There 
is a maximum detectable axion mass for these laser ex- 
periments as a result of the oscillation length becoming 
shorter than the magnetic field length causing a degra- 
dation of the form factor F(q) = 1 — cos(qL)/(qL)? but 
this can be compensated for by multiple discrete dipoles. 


(108) 


The first experiment using this technique was per- 
formed by a collaboration consisting of Rochester- 
Brookhaven-Fermilab-Trieste (RBFT) in early 1990s 51, 
Cameron et. al. (1993)]. Using two superconducting 
dipole magnets (L = 4.4 m and B = 3.7 T) and a laser 
(A = 514 nm and P = 3 W) with an optical cavity pro- 
viding ~ 200 reflections in the axion generating region 
they were able to set upper limits on axion couplings of 
Jayy < 6.1 x 1077 GeV (95% C.L.) for pseudoscalars with 
a maximum mass of Ma < 107? eV |5], Cameron et al. 
(1993)]. 

Recent photon regeneration experiments include the 
BMV collaboration at LULI , Robilliard et al. 
(2007)] which uses a short pulsed-field magnet and the 
GammeV collaboration at Fermilab fod, Chou et al. 
(2008)] which uses a Tevatron dipole magnet (L — 6 m 
and B — 5 T) with an optical barrier in the middle. Both 
of which have ruled out the signal reported by the PVLAS 
(see next section). Fig. ZJillustrates the current bounds 
from these latest regeneration experiments. Recently it 
has been shown that photon regeneration experiments 
can be resonantly enhanced by encompassing both the 
production and reconversion magnets in matched Fabry- 
Perot optical resonators B34 Sikivie, Tanner, van Bibber 
(2007)]. 
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FIG. 24: Current limits on axion coupling from the GammeV 
collaboration [lod B73). 
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FIG. 25: The upper figure shows the effect of dichroism as 
photons converting into axions cause a rotation of the linear 
beams polarization vector by an amount e. The lower figure 
shows virtual axions inducing birefringence in which the linear 
beam acquires ellipticity V L3 


3. Magneto-Optical Vacuum. Effects 


An alternative to the “shining light through walls” 
technique is to look for the indirect effect of photons in 
polarized laser light converting into axions as the beam 
traverses a magnetic field. Fig. illustrates the two 
different ways in which axion interactions can modify 
a polarized laser beam, induced dichroism and vacuum 
birefringence. Vacuum dichroism occurs when a polar- 
ized laser beam passes through a dipole magnet with the 
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electric field component E at a non-zero angle @ rela- 
tive to the B. The photon component parallel to the 
B will have a small probability to convert into axions 
causing the polarization vector to rotate by an angle e. 
Vacuum birefringence is due to the induced ellipticity of 
the beam (V) as a result of virtual axions. It should be 
noted that higher order QED diagrams, or “light-by-light 
scattering" diagrams, are expected to contribute to vac- 
uum birefringence as well. Each of these effects can be 
estimated as 


B2L3m2 a 2 
Ue n= (=) sin(20), (109) 
W a 
BL asd 
ex N 64 (22) sin(20), (110) 


in the limit that m2L/4w « 1. Here L is the effective 
path lengths, N is the number of paths the light travels in 
the magnetic field, m, is the axion mass, w is the photon 
energy and @ is the photon polarization relative to the 
magnetic field BY, Battesti et al. (2007)]. 


The initial experiment looking for magneto-optical vac- 
uum effects was carried out by the RBFT Collaboration 
in the early 1990s B29, Semertzidis et. al. (1990)]. This 
experiment used a single-pass length 8.8 m magnet with 
a magnetic field of B — 2.1 T and N — 500. It set a limit 
on the polarization rotation of e < 3.5 x 1071? which was 
still 3 orders of magnitude higher than that expected by 
light-by-light scattering and almost 15 orders of magni- 
tude greater than a m, ~ 1073 eV axion. 


Recently the early PVLAS collaboration reported the 
positive detection of vacuum dichroism. This experiment 
consists of a 1 m long 5 T superconducting magnet with 
a angular frequency Qmag of the magnet rotation and a 
6.4 m long Fabry-Perot cavity giving the pass number 
N = 20mag/7 ~ 44,000. It registered a polarization 
shift of 


e = (3.9 + 0.5) x 10^ Prad pass! 


(111) 


which translates to an allowed mass range of a neu- 
tral pseudoscalar boson of 1 meV< my <1.5 meV and 
a coupling strength of 1.5 x 107?GeV^! < cayy/Fa < 
8.6 x 10-3GeV~! [37d, Zavattini et al. (PVLAS Collabo- 
ration) (2006)]. Though the report of this positive signal 
has been retracted : Zavattini et al. (PVLAS 
Collaboration) (2007, 2008)], the interest it raised has 
lead to a number of more advanced experimental searches 
such as some of the new laser regeneration experiments 
mentioned previously. 
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VI. THEORIES FOR VERY LIGHT AXIONS 


Axions from 


String theory 
M-theory 


Order of Fa 


String scale or Planck scale 


String or the scale of the 11th dimension 


Large extra (n) dimension|Combination of the fundamental mass Mp and extra dimension radius R 


Composite models 


Renormalizable theories 


TABLE I: Natural scales of Fa. For n large extra dimension, the Planck mass is Mp ~ 


The axion couplings come in three types: the PQ sym- 
metry preserving derivative coupling cı term, the PQ 
symmetric c9 term, and the anomalous c3 term. The 
PQ symmetry gives a gluon anomaly and cg + c3 must 
be nonzero. Generally, we can therefore define a as a 
pseudoscalar field without potential terms except the one 
arising from the gluon anomaly under certain basis (for 
example in the c9 = 0 basis), 


a 6 Gay. 


Fa ed m 


Then, we note that this kind of nonrenormalizable 
anomalous term can arise in several ways. The natu- 
ral scales of Fa are shown in Table [Ī In any case, the 
essence of the axion solution (wherever it originates in 
Table [ is that the axion VEV (a) seeks 0 = 0 what- 
ever happened before. In this sense it is a cosmological 
solution. The potential arising from the anomaly term 
after integrating out the gluon field is the axion potential 
(with c2 + c3) sketched in Fig. [J 


A. SM singlets without SUSY 


A complex SM singlet carrying the PQ charge can ap- 
pear in many extensions of the SM: in grand unified theo- 
ries(GUTs) Wise, Georgi, Glashow (1981)], in com- 
posite models Kim (1985), Choi, Kim (1985), 
Babu, Choi, Pati, Zhang (1994)], and in models with ex- 
tra dimensions , Di Lella, Pilaftsis, Raffelt, Zioutas 
(2000)]. 

In the SU(5) GUT, the axion can be embedded in a 
complex 24 — X Bes, Wise, Georgi, Glashow (1981)], 
in which case the VEV (X) breaking SU(5) down to the 
SM and hence the axion decay constant is the GUT scale 
and is outside the axion window. On the other hand, 
a complex GUT singlet, whose VEV is not related to 
the GUT scale, can house the axion within the axion 


Compositeness scale 


The U(1)pq global symmetry breaking scale 


Mp(R/Mp)"”?. 


m 


window. The SUSY generalization of the SU(5) GUT 
axion has been shown to be possible ps4, Nilles, Raby 
(1982)]. Recently, the electrophilic axion in view of the 
white dwarf evolution Bog, Isern, García-Berro, Tos, 
Catalán (2008)] with the two dark matter scenario 
Huh, Kim, Kyae (2008)] has been suggested in a SUSY 
flipped SU(5 ) Ed Bae, Huh, Kim, Kyae, Viollier (2008)]. 


B. Composite axions 


A SM singlet for the very light axion can arise as a com- 
posite meson with an extra confining force whose scale is 
much larger than the electroweak scale. This confining 
force can be the hidden sector gauge group in supergrav- 
ity or just an extra gauge group. Let us call this extra 
confining gauge group ‘axi-color’ SU(N). To create the 
QCD axion below the axi-color scale, there must be two 
classically conserved axial global symmetries foil 
Kim (1985), Choi, Kim (1985)]. With only one axial 
symmetry, there would not result a massless meson, as 
in the case of one flavor QCD there is no massless meson 
since the only meson 7’ becomes heavy by the instan- 
ton solution of the so-called U(1) problem , t Hooft 
(1976)]. For two axial symmetries, we can consider two 
kinds of axi-quarks, Q^^, Q 4a, q^, and 44 where A is the 
SU(N) index and a is the SU(3), index. For these vector- 
like representations, (N, 3) + (N, 3) + (N, 1) - (N, 1) un- 
der SU(N)xSU(3)., mass terms are not introduced. The 
axi-color vacuum angle problem is solved basically by the 
massless axi-quarks, Q and q. Even though Q looks like 
a massless QCD quark, it cannot be considered as the 
massless quark solution of the QCD 0 problem. After 
integrating out the axi-color degrees, we obtain an effec- 
tive Lagrangian resulting from Q and q. The axi-baryons 
are expected to be removed at the axi-color scale. Out of 
two kinds of mesons, one (the axi-color 7’) is removed at 
the axi-color scale and the other remains exactly mass- 
less. However, this massless axi-color meson couples to 


the QCD anomaly and becomes a QCD axion through 
the c3 term, becoming the so-called hadronic axion. Out 
of the two currents, 


—5 — E 
Ja = Qus + TYu^rsd 


- g (113) 
Ja = QWB — 397759; 


the divergence of d corresponds to the massless meson 
a below the axi-color scale, 


2N E 
D e. [2] auv 
o" Ja = 3252 04,6 pu. (114) 
and hence we obtain the effective interaction . In 


this minimal model, the domain wall number is N {91i 192, 
Choi, Kim (1985)]. In a supergravity model of preons, a 
similar mechanism was used to realize a composite axion 

, Babu, Choi, Pati, Zhang (1994)], where the role of 
q type matter is replaced by the meta-color gluino A 
where meta-color is the binding force of preons. Even if 
the meta-color gluino obtains a mass of O(100 GeV), the 
QCD 0 can be made within the experimental bound if Fa 
is greater than 10!! GeV. 

The composite axion of [103, Chun, Kim, Nilles (1992)] 
is a composite made of hidden-color scalars whose bilin- 
ears develop VEVs and break the PQ symmetry. This 
idea has been made more concrete in , Kim, Nilles 
(2009)]. 

In the gauge mediated SUSY breaking scenario a la 
Intrilligator, Seiberg and Shih (ISS) , Intrilligator, 
Seiberg, Shih (2006)], for example, an SU(N.) confining 
group with Ny flavors satisfying Ne +1 < Ny < Ne 
allows a SUSY breaking local minimum. If N; — Ne > 
3 (for example Ne = 7 and Ny = 10) with one type 
Q^* + Qaa and Ny — 3 flavors of the type q^ + 95, 
then there can exists a suitable local minimum where the 
composite axion envisioned in Eq. can be realized. 
In this case, the SUSY breaking scale and the composite 
axion scale are related as first tried in 225, Kim (1984)]. 


C. Axions with extra dimensions 


With large extra dimensions, the axion identification 
involves a few parameters: the fundamental scale mass 
Mr, the Kaluza-Klein (KK) radius R and the number of 
extra dimensions n. In addition, there are several ways 
to allocate the field(s) containing the axion in the bulk 
and/or branes. 

The possibility of large extra dimensions have been 
considered for the flat and warped extra dimensions. The 
TeV scale for Mp was the main motivation to look for the 
next level of the current experimental limit on the mm 
scale gravity fd, , Arkani-Hamed, Dimopoulos, Dvali 
(1998), Antoniadis, Arkani-Hamed, Dimopoulos, Dvali 
(1998)]. Because the axion scale is considered to be at 
the intermediate scale, a string theory at the intermediate 
scale Mp has been considered also , Burgess, Ibanez, 
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Quevedo (1999)]. With the Randall-Sundrum type warp 
factor , Randall, Sundrum (1999)], it is possible to 
introduce the intermediate scale with a Planck scale Mp 
via the Giddings-Kachru-Polchinski stabilization mecha- 
nism [172, Giddings, Kachru, Polchinski (2002). 

Here, we look only at the possibility of TeV scale Mp. 
Since the Planck mass is given by Mp = My(RMy)"/?, 
we obtain n > 2 for Mp ~ 10 TeV isg, Hannestad, 
Raffelt (2002), Kanti (2008)]. The Lagrangian in (4 + 
n)—dimensions ((4+ n)D) with a bulk field axion can be 
written as (Z9, Chang, Tezawa, Yamaguchi (2000)], 


Bae dyf 1 Mg [ð aða + dyad¥a| 


p Seem aig Fee) (115) 


"T UVpQ HY 
where Upo is the PQ symmetry breaking scale at the 
fundamental scale order Mp, and 


alz"; y) = 3 ay (x^) cos (=) : (116) 


n=0 


The 4-dimensional PQ symmetry breaking scale is Fy ~ 
(Mp/My)4/"Tpq where A = |n| = Yn +---+n2 <n 
and F, falls between Upo and Mp. The very light axion 
is the n = 0 component in Eq. (L1G), and the rest are the 
KK axions. The mass splitting of the KK axions are of 
order 1/R, and the phenomenology of these KK axions 
for agg — 2y has been nicely studied in BI Di Lella, 
Pilaftsis, Raffelt, Zioutas (2000)], from which we have 
1/R ~ 1(10) eV for n = 2(3) for Mp ~ 1 TeV. 

The possibility of a Z2 odd 5D gauge field in a warped 
fifth dimension has been suggested for a QCD axion 
under the assumption that all unwanted PQ symmetry 
breaking effects are suppressed |83, Choi (2004)]. One 
such constraint is that the bulk fields carry the vanishing 
PQ charge. 


D. SUSY breaking scale, axion and axino 


The 4D supergravity interactions with the vanishing 
cosmological constant was written in 1983 , Crem- 
mer, Ferrara, Girardello, van Préyen (1983)]. The PQ 
symmetry can be embedded in the supergravity frame- 
work , Kim (1984)], 


Wee = (f5.A142 = FZZ + (feA1A2 = F2)Z' 


d (117) 
tfo41Q1Q» 


where Z,Z’, A and A» are gauge singlet chiral fields, 
Q, and Q» are chiral quark superfields, fs, fe, F2, and 
F2 are parameters. The superpotential (IZ leads to 
the F-term SUSY breaking and the PQ symmetry break- 
ing at a common scale at order O(F2, F2) if f5/f. £ 
F?/F3. The fg term defines the PQ charge of the 
heavy quark and the resulting axion is the KSVZ type. 


The PQ symmetry breaking scale is given by nonzero 
(A143) ~ (F?/X)cos(8 — a) and the SUSY breaking 
scale is given by nonzero Zp = —F? sinasin(f — a) and 
Zi, = F?sinosin(f — a) where \ = yf? + f2, F? = 

Ff + Fj tana = f./fs and tanB = F2/F2. The ax- 
ino does not obtain mass at this level, but obtains a 
mass at order the soft SUSY breaking scale fod, 
Chun, Kim, Nilles (1992), Chun, Lukas (1995)]. Note 
that (Z'/Z) ~ — coto with (Z, Z’) = O(F?/M). The 
early discussions on axino can be found in s Frere, 
Gerard (1983)]. 


E. The u problem 


If Higgs doublets Hua carry vanishing U(1) charges be- 
yond the MSSM gauge charges, then the superpotential 
can contain W, = —u HH term where u can be of order 
the fundamental scale since it is a supersymmetric term. 
'This is problematic for the TeV scale electroweak sym- 
metry breaking, which is the so-called u problem 
Kim, Nilles (1984). This jj term is a supersymmetric 
Higgsino mass term and can be forbidden in W by in- 
troducing some symmetry, continuous or discrete. The 
widely discussed ones are the PQ and R symmetries. In 
the supergravity framework, if the Higgs doublets carry 
one unit of PQ charge then the nonrenormalizable inter- 
actions of the form S? H,, Ha/Mp can be present in W if 
S? and H,,H4 carry the opposite PQ charges. Then, the 
resulting p is of order F2/M p which can be of order of the 
TeV scale [239, Kim, Nilles (1984)]. With an intermedi- 
ate hidden sector with hidden sector squarks Q4 and Qo, 
one may have a nonrenormalizable interaction of the form 
Q1Q, H,Ha/Mp. In this case also, the hidden sector 
squark condensation at the intermediate mass scale can 
generate a TeV scale u , Chun, Kim, Nilles (1992)]. 
[For the B, term, one may consider (Q13/M2)H, Ha 
in the Kahler potential] The superpotential is better 
to possess this kind of PQ symmetry and/or R symme- 
try 6g, , , ; , Dine, MacIntire (1992), Hall, 
Randall (1991), Casas, Munoz (1993), Kim, Nilles (1994), 
Kim(1999a)]. If so, even if the nonrenormalizable inter- 
actions are not considered, the gravity mediation scenario 
can generate a TeV scale u via the Giudice-Masiero mech- 
anism [zd Giudice, Masiero (1988)]. In supergravity, 
the Higgsino mass term is present in the chiral fermion 
mass matrix given by , , Cremmer, Ferrara, Gi- 
rardello, van Pröyen (1983), Nilles (1984)] 

TCG” — G'G? — G'(G7')EGY]xuixus (118) 
where G = K(¢,¢*) —In|W|?. The term e-9G gives 
u ~ M3/2 if K contains H,Hq Giudice, Masiero 
(1988)] and u ~ 5?/Mp if W contains S? H,H4/Mp [23d, 
Kim, Nilles (1984)]. 

In the next-to-MSSM(NMMSM) models with W = 
SH,H,, the p term can be generated by the singlet 
VEV at the electroweak scale Ed bod Cerdeno, Hugonie, 
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López-Fogliani, Muñoz, Teixeira (2004), López-Fogliani, 
Munoz (2006)]. In a Z'-added MSSM(Z'MSSM,, the u 
term can be successfully generated also 54) Langacker, 
Paz, Wang, Yavin (2008)]. 

Extending the MSSM gauge group which can be bro- 
ken down to the MSSM at a high energy scale, one can 
generate a reasonable u. For example, there exists an in- 
teresting solution to the problem, “Why is there only one 
pair of Higgsino doublets at low energy?”, in the extended 
electroweak model B54 Lee, Weinberg (1977)]. This is 
dictated by the extended gauge symmetry. This one pair 
problem is elegantly solved in the SUSY Lee-Weinberg 
type model due to the antisymmetric Higgsino mass ma- 
trix , Kim (2007)], reminiscent of the “color” intro- 
duction to put the low-lying baryons in the completely 
symmetric representation 56 in the old flavor-spin SU(6) 

Han, Nambu (1965)]. 

Thus, explicit models toward a successful jz in the grav- 
ity mediation scenario can be constructed in extra singlet 
models, in SUSY-GUT models, through the superpoten- 
tial, through the Kahler potential, and in composite mod- 
els. 

The loop effects are the important sources of SUSY 
breaking in the gau d o3 (ca SUSY breaking scenario 
(GMSB) [132 fad Ds fod [ra L39, Dine, Nelson (1993), 
Dine, Nelson, Shirman ( imos Dine, Fischler, Srednicki 
(1981), Dimopoulos, Raby (1981), Dine, Fischler 98 
in anomaly mediation SUSY breaking ( (AMSB) ps Ed 
Randall, Sundrum (1999), Giudice, Luty, E 
Rattazzi (1999)], and even in the mirage mediation sce- 
nario B4, Choi, Jeong, Okumura (2005), Loaiza- 
Brito, Martin, Nilles, Ratz (2005). The GMSB has 
been suggested to solve the flavor problem , Gab- 
biani, Gabrielli, Masiero, Silvestrini (1996)], present in 
the gravity mediation scenario. In this GMSB or any 
other loop generated SUSY breaking scenarios, the soft 
terms generated by supergravity effect are required to be 
subdominant compared to those arising from the loops 
or at best comparable to it. If it is sub-dominant as in 
the GMSB or AMSB, then there are some problems. 

Firstly, the generation of u is difficult because u term 
generation via the Giudice-Masiero mechanism is sub- 
dominant at the TeV scale. One has to generate u em- 
ploying the PQ and/or R symmetries, which however 
does not belong to the grand design of generating all TeV 
scale parameters dynamically. In this regard, another 
confining group around TeV scale has been proposed [8&, 
Choi, Kim (2000)], and the model presented there be- 
longs to a composite SU(2)w axion discussed in Subsec. 
[VIB] which was saved by introducing singlets and rele- 
vant couplings 264, Luty, Terning, Grant (2001)]. Then, 
again it does not succeed in generating all TeV scale pa- 
rameters dynamically. 

Second, in the loop SUSY breaking scenarios for gen- 
erating all TeV scale electroweak parameters by loops, 
there exists the B,,/j problem , Dvali, Giudice, Po- 
marol (1996)]. Since it occurs at loop orders, we consider 
f 440 H, H4X! for p and f d*0H,HaX X! for B, where 


the auxiliary component of X develops a VEV. From 
this observation, one generically obtains B,, ~ uA where 
A ~ p/f? can be greater than u which was remedied 
by making B, appear at a two loop order [54 Dvali, 
Giudice, Pomarol (1996) This B,,/j problem occurs 
basically from the difference of the engineering dimen- 
sions of the B, and u terms. Both generically appear at 
one loop order with the coefficient g?/1677, and hence 
in describing the electroweak scale B, term lacks one 
power of g?/167?. Recently, a better solution employing 
a Kàáhler potential H, Hj(In X + ln X!) has been sug- 
gested , Giudice, Kim, Rattazzi (2008)], which can 
be compared to the original Giudice-Masiero Kahler po- 
tential H, Ha X! 4----. There exist several more ideas to- 
ward the B,,/u problem Ed Dod Cohen, Roy, 
Schmaltz (2007), Murayama, Nomura, Poland (2008), 
Roy, Schmaltz (2008), Cho (2008)] 

Maybe, the nonrenormalizable interactions are the 
easy solution of the u and B,/p problems even in the 
GMSB also. Here however one introduces another scale. 
Without a detail knowledge on the ultra-violet comple- 
tion of the MSSM, the nonrenormalizable interactions are 
usually assumed to be suppressed by the Planck mass 
Mp. But, there can be some heavy mass scale M, which 
can be somewhat smaller than the Planck mass Mp, for 
the seesaw mass of the required nonrenormalizable inter- 
actions. In string compactifications, it is known explic- 
itly that M can be different from Mp b3, lod, bad Choi, 
Kim (2006), Kim, Kim, Kyae (2007), Choi, Kobayashi 
(2008)]. A simple diagram giving an M dependence is 
shown in Fig. [2x] where SU(2) doublets Hı and Hə forms 
a vectorlike superheavy pair. This is a kind of the seesaw 
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FIG. 26: The generation of the u term by a seesaw mecha- 
nism. 


mechanism of Higgsino doublet pairs. For this scenario, a 
superpotential possessing the PQ symmetry can be con- 
structed 


W = imX? + fx X? + 49°T + XH Ho 
—fiS Hı Hu — foSH2Ha+--: 


(119) 


where (X) = M and H,,H4 term is forbidden by the PQ 
symmetry. Then, the u term for H,, and H4 (which give 
mass to up and down type quarks, respectively) is given 
by u = (AfaS?/M) bad Kim, Nilles (1984)]. If (S) 
is lowered to the hidden sector confining scale of order 
~ 101071? GeV in the GMSB, the Higgsino mass can be 
made around the TeV scale by adjusting fi f2/M. One 
may construct models with appropriate F-terms such 
that B, and m2, are of the same order in the GMSB, 
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e.g. through the PQ symmetry preserving term in the 
Kahler potential 


pot na + h.c. (120) 


which also gives a u term. 

In the so-called “mixed mediation(M-mediation) sce- 
nario”, with comparable moduli, anomaly and gauge me- 
diations, that includes in its parameter space the GMSB, 
the AMSB, the mirage mediation, and the deflected 
mirage mediation , Everett, Kim, Ouyang, Zurek 
(2008)], the loop generated u term in general has a se- 
vere B„/u problem. It seems that the model presented in 
an AMSB scenario [314, Pomarol, Rattazzi (1999), 
Rattazzi, Strumia, Wells (2000)] has the basic ingredient 
for the solution of B,,/u problem according to a PQ sym- 
metry as stressed in Giudice, Kim, Rattazzi (2008)]. 
This can be gleaned also from the axion shift symmetry 
in the mirage mediation scenario , Nakamura, Oku- 
mura, Yamaguchi (2008)]. 


F. Axions from superstring 


The most interesting theory housing axions is super- 
string theory. Axions from string are described by ef- 
fective field theory below the compactification scale. If 
it arises from the spontaneous symmetry breaking of a 
tree-level global symmetry as we discussed so far, the 
answer is simple: There is no such axion since string the- 
ory would not allow any global symmetry. If the com- 
pactification process leads to the SM, the renormalizable 
terms in this effective theory respect the gauge symmetry 
SU(3)-xSU(2)xU(1)y and the global symmetries: the 
baryion number U(1)g and the separate lepton numbers 
U(1)r; On the other hand, if the nonrenormalizable 
terms are allowed, one can write for example qLlLURdR, 
breaking both the baryon number and the lepton number. 
Including the nonrenormalizable terms, thus the SM does 
not respect the baryon and lepton numbers. Similarly, 
there is no PQ global symmetry if we are allowed to write 
all nonrenormalizable terms. For the PQ symmetry, the 
situation is more severe. Suppose the singlet carrying 
the PQ charge is c. Then, o*o respects the PQ sym- 
metry but o? and o? do not respect the PQ symmetry, 
which has led to the discussion on the gravitational ef- 
fects on the axion |30, , l , Kamionkowski, 
March-Russell (1992), Barr, Seckel (1992), Holman, Hsu, 
Kephart, Kolb, Watkins, Widrow (1992), Ghigna, Lusig- 
noli, Roncadelli (1992), Dobrescu (1997)]. Therefore, the 
PQ symmetry cannot be discussed in general terms in 
terms of matter fields only, when we include gravity in 
the discussion such as in string theory. 

Thus, in string compactification one must consider the 
gravity multiplet also. Here, the gauge singlet bosonic de- 
grees in the gravity multiplet are graviton gyn (M, N = 
0,1,---,9), antisymmetric tensor Byyy and dilaton 6. 


In 10D, gmn and Bmyn are gauge fields. A 4D action on 
Minkowski space z^"(u = 0,1,2,3) is obtained by com- 
pactifying six internal spaces y'(i = 4,--- ,9) with the 
compact volume Vz. Some bosonic degrees from the 10D 
antisymmetric tensor field behave like pseudoscalars in 
the 4D effective theory. Thus, the axion candidates, if 
they are not arising from the matter multiplets, must be 
in Byn. The pseudoscalar fields in By are like phase 
fields in axion models in field theory. Because there is 
no global symmetry in string theory, there must be no 
massless Bmy, otherwise the shift symmetry of Byrn 
must have worked as a global symmetry. From the tree- 
level equations of motion all pseudoscalar By fields are 
not massless. For example, if a shift symmetry of BMN 
is related to an anomaly as in the PQ current case, we 
consider that the shift symmetry is already broken. In 
other words, there is no shift symmetry of pseudoscalar 
Byrn if it is not anomalous. 

One must deal with these bosonic degrees in string 
compactification to see whether these components do 
not lead to terms in the potential (or the superpotential 
in SUSY models), which is very technical and model- 
dependent procedure. Here, we briefly discuss the story 
on axions from string and comment on its phenomeno- 
logical viability. Some relevant recent reviews describing 
details can be found in fod Bad Svrcek, Witten (2006), 
Conlon (2006)]. The M-theory discussion was presented 
in [B]. Choi (1997), Svrcek, Witten (2006)]. 

The pseudoscalar fields in Bw come in two cate- 
gories, one the tangential component B,, and the other 
Bij. By, can be discussed in any string compactifica- 
tion and hence is called *model-independent'(MI) while 
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where Try is the trace over vector representation and the 
Chern-Simons 3-form is 


w3 = Tr, (4 MdA: +A; ^ A; ^A). (122) 


For EgxEs, there is the adjoint representation and we 
use 35 Tra in place of Tr,. For the compact internal vol- 
ume Vz, the Planck mass is Mp = 41Vz/g2(5 and the 4D 
gauge coupling constant is Don = Ang?l®/Vz or ayy = 
g2°/Vz. In most compactifications, the SM gauge fields 
arise from the level k = 1 embedding and the coupling 
aym is the coupling strength at the compactification 
scale. If the SM gauge fields are embedded in the level 


M$ 
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Bij depends on the compactification scheme as its in- 
ternal coordinates 7 and j imply and is called ‘model- 
dependent’ (MD). After presenting the string formulae 
containing Bw, we will discuss the MI axion in Sub- 
subsec. [VIE 1] and then the MD axions present in much 
more speculative models in [VIF 7] One can go directly 
to Subsubsec. [VIF T] if not interested in the details on 
the origin of the couplings. 

Now, there exists the standard formulae for the string 
action , Polchinski (1988), Eq. (13.3.22)], which was 
lacking in the early days of string axions [89, , Wit- 
ten (1984), Choi, Kim (1985)]. The Type II dilaton órr; 
and coupling gr; = €?'! are related to the 10D gravita- 
tional coupling Kio by M8, = 1/k29 = 4n/g2,05 where 
a’ = (?/(21)?. For the Type II string, there are NS- 
NS and R-R fluxes which can give anomalous couplings. 
These complicated system housing pseudoscalars is re- 
viewed in , Conlon (2006)] with a possible conclusion 
of the difficulty of realizing a QCD axion in string mod- 
els with a workable moduli stabilization Bid, Kachru, 
Kallosh, Linde, Trivedi (2003)]. In heterotic string mod- 
els, there does not exist a reasonable moduli stabilization 
mechanism, even though an ambitious attempt has been 
proposed |83, Becker, Becker, Fu, Tseng, Yau (2006)]. 
We however discuss heterotic string axions below because 
string axions were found first in heterotic string mod- 
els and key couplings on axion phenomenology might be 
similarly discussed also for the Type II string. As in the 
Type II string, the heterotic string coupling is related 
to the dilaton ® as ga = e?. The kinetic energy terms 
of gun, Bun and Am are 298, Polchinski (1988), Eqs. 
(12.1.39) and (12.3.36)] 
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k, the SM gauge coupling at the compactification scale 
will be smaller by the factor k. For interactions of Byrn, 
we consider the Bianchi identity, the gauge invariant cou- 
plings of gaugino x l , Derendinger, Ibanez, Nilles 
(1985), Dine, Rohm, Seiberg, Witten (1985)], and the 
Green-Schwarz terms , Green, Schwarz (1984)] 
dT 


162? 


(trR A R—trF^ F), HmynpX "tU. 
BAtrFAFAFAF+-::: (123) 


where Hy p is the field strength of Bumn, F is the field 
strength of the gauge field A, the gauge invariant fermion 
coupling is the SUSY counterpart of the relevant terms of 
Eq. (L21], and --- denotes more Green-Schwarz terms. 


It was argued that the Hmyp coupling to the gaugino 
must be a perfect square [140, Dine, Rohm, Seiberg, Wit- 
ten (1985)], which gives a vanishing cosmological con- 
stant even for a nonvanishing gaugino condensation with 
nonzero (Hyp) [24 [14d, Derendinger, Ibanez, Nilles 
(1985), Dine, Rohm, Seiberg, Witten (1985)]. 


1. Model-independent axion 


D,, with u and v tangent to the 4D Minkowski space- 
time is the MI axion present in all string compactifica- 
tions Bi Witten (1984)]. Because it is a 4D gauge bo- 
son, one cannot write potential terms in terms of By, 
and it is massless if one neglects the anomaly term. 
The number of transverse degrees in B,, is one and 
can be expressed as a pseudoscalar a by dualizing it, 
Hyp X Fa€urgo0^a. Even though it is massless at this 
level, the Bianchi identity of gives an equation of 
motion of a as 0?a = (1/327? Fa)G$, G°!” which hints 
that a might be an axion. For it to be really a QCD 
axion, C2 + c3 should be nonzero as discussed in Subsec. 
[TB] It is known that c = 1 B73, Witten (1985)] with 
cg defined in Eq. (19). The other possible couplings are 
given by the second term of 


Fa Fa 2 

ME, Cio aM IXP x E Mg, Ce eX aui (124) 
which is the cı term defined in Eq. (9). There is no 
cg term, and c2 + c3 = 1 and hence H,,, is really an 
axion and is model-independent.^ This is a hadronic ax- 
ion. This MI hadronic axion can have a nonvanishing 
c; and hence its phenomenology can be different from 
that of the KSVZ hadronic axion. So, in Eqs. (61862) 
for the MI hadronic axion, one has to add the relevant 
c, term from Eq. (224). The domain wall number of 
the MI axion has been shown to be Npw = 1 by con- 
sidering the coupling of the MI axion to string X M (o, 7) 
on the world sheet f d?ec^? B,,0, X"0g X" , Witten 
(1985). F, is about 1078 times the Planck mass [89, 
Choi, Kim (1985a)], and the correct relation, obtained 
from (Z), is Fa/k = o-Mp/2?/?x ~ 1019 GeV where 
k is the level of the SM embedding and a, is the QCD 
coupling constant [343, Svrcek, Witten (2006)]. But the 
value F, ~ 1016 GeV most probably overcloses the uni- 
verse. 

An idea for lowering the MI axion decay constant may 
be the following. In some compactification schemes, an 
anomalous U(1)an gauge symmetry results, where the 
U(1)an gauge boson eats the MI axion so that the U(1)an 
gauge boson becomes heavy. This applies to the MI 
axion since the coupling O"aw; A" is present by the 


^ Nevertheless, its property may depend on models in warped 
space , Dasgupta, Firouzjahi, Gwyn (2008)]. 
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Green-Schwarz term [o3 Bz Witten (1984), Chun, 
Kim, Nilles (1992)]. In fact, even before considering 
this anomalous U(1)a, gauge boson, the possibility was 
pointed out by Barr |29, Barr (1985)], which became a 
consistent theory after discovering the anomalous U(1)an 
from string compactification id, tad [43 Dine, Seiberg, 
Witten (1987), Atick, Sen (1987), Dine, Ichinose, Seiberg 
(1987). Then, a global symmetry survives down the 
anomalous U(1)an gauge boson scale. A detailed scenario 
is the following. The way anomalous U(1)an with the 
gauge transformation, 04, — 9an+constant, is obtained is 
by calculating U(1)an charges of fermions. Thus, we have 
a non-vanishing c; in Eq. (I9) as mv; vg exp(ica0,,) and 
c3 of the MI axion as c30ur{ FF}. For all gauge group 
factors, the anomaly units are calculated and they are 
shown to be identical ba, , Casas, Kathehou, Munoz 
(1988), Kim (1988)]. For the MI axion to be the part of 
a gauge boson, it must be a true Goldstone boson with- 
out an anomaly, i.e. it should be exactly massless; so let 
us transform away the c3 term by a phase redefinition of 
fermions such that @ = c3 — c3(0 m1) /044 and c = 0 can 
be made for all gauge fields, i.e. ay;; coupling to anoma- 
lies vanishes for all gauge groups. Because the longitudi- 
nal gauge boson amz is removed away, we are left with 
the c term only, mv; Vg exp(ica04,) + h.c, without the 
need to consider the gauge symmetry U(1)an. At low en- 
ergy, however, the term mV yg exp(i@29an) has a global 
symmetry, 04, — 044 + constant, with 044 not depending 
on z^". Thus, the interaction muU rUR exp(?@20an) + h.c. 
explicitly shows a global U(1) axial symmetry or the 
PQ symmetry below the U(1)an gauge boson mass scale: 
Vr > wre =/? and yr > Wre/2. This global PQ 
symmetry can be broken in the axion window as in the 
field theoretic axion models. However, this idea on the 
decay constant does not work necessarily because most 
fields, including those removed at the GUT scale, carry 
the U(1)an charge. 


2. Model-dependent arion 


In 4D, Byrn contains more pseudoscalars By; with i 
and j tangent to the compact space Vz. If they are ax- 
ions, these are MD axions. The number of massless Bij 
modes at the KK mass level is the second Betti num- 
ber of the compact space [isa Green, Schwarz, Wit- 
ten (1987), Eq. (14.3.10)], which was discussed in the 
early days in T Bz, B72, Witten (1984,1985), Choi, 
Kim (1985)]. The string propagation on M4 x Vz can be 
described by a suitable nonlinear sigma model. In this 
sigma model description, when a closed string wraps Vz 
topologically nontrivially then there are world-sheet in- 
stantons due to the map Sı — U(1). It was known that 
the world-sheet instantons are present precisely if the sec- 
ond Betti number is nonzero , Green, Schwarz, Wit- 
ten (1987)], and hence the MD axions are expected to re- 
ceive non-negligible masses non-perturbatively [al 


65, Wen, Witten (1986), Dine, Seiberg, Wen, Witten 
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1986,1987)], but it may be a model dependent statement 
Es Polchinski (2006)]. If a MD axion is known to have 
no potential term except the anomaly terms, then one 
should check the cz and c3 couplings to confirm that it is 
really an axion. There has been no example presented yet 
in this way for a MD axion. If a MD axion is present, its 
decay constant is expected to be near the string scale as 
explicitly given as Fyp = all? Mp [23/2 51/3 92/3 from 
the anomaly term alone in , Svrcek, Witten (2006)]. 
'The Green-Schwarz term integrated over Vz leads to this 
kind of the decay constant for the MD axion lod, Choi, 
Kim (1985)]. However, as commented above, one has to 
calculate the corresponding c» term also to pinpoint the 
MD axion decay constant Fyp. 


3. Toward a plausible QCD azion from string 


A key problem in string axion models is to find a 
method obtaining a QCD axion at the axion window 
(10? GeV < F, x 10? GeV) but an attractive model 
toward this direction is still lacking. Thus, the most 
pressing issues is the problem of introducing a detectable 
QCD axion from superstring. It includes the search for 
an approximate PQ symmetry toward a detectable QCD 
axion. 


'The conditions for compactified manifolds in warped 
space toward lowering the MI axion decay constant has 
been discussed in , Dasgupta, Firouzjahi, Gwyn], but 
its realization seems nontrivial. 


The idea of localizing MD axions at fixed points toward 
lowering the decay constant has been proposed in 23 I. 
W. Kim, J. E. Kim (2006)]. It uses the warp factor idea 
and one needs a so-called Giddings-Kachru-Polchinski 
throat [L72, Giddings, Kachru, Polchinski (2002)] in the 
Type-II string, but in the heterotic string a non-Kahler 
Vz is needed |33, Becker, Becker, Fu, Tseng, Yau (2006)]. 
Indeed the warp factor is obtained in this way, which 
however has the power law behavior. 


The intermediate scale string models can introduce the 
axion window as the ultra-violet completion scale | 
Burgess, Ibañez, Quevedo (1999)]. On the other hand, in 
this case the large radius to generate the Planck mass is 
the scale needing explanation. 


We note that an attractive method of obtaining Fa in 
the axion window is through the composite axion from 
superstring as discussed in Subsec. However, from 
string construction the composite axion has not been ob- 
tained so far. 


Even if one lowered some Fa, we must consider the 
hidden sector also in estimating the axion masses and 
decay constants as discussed below. 


39 


4. Hidden sector confining forces, axion mixing, and 
approximate PQ symmetry 


With the hidden-sector confining forces, we need at 
least two (QCD and one hidden-sector) 0s which have to 
be settled to zero and hence we need at least two axions. 
For definiteness, let us consider only one more confining 
force at an intermediate scale, which may be the source 
of gravity mediation or GMSB. In this case, at least one 
MD axion is assumed to be present, and the axion mixing 
must be considered. We assume that one decay constant 
is in the intermediate scale. Here, there is an important 
(almost) theorem: the cross theorem on decay constants 
and condensation scales. Suppose that there are two ax- 
ions a1 with F} and a» with F> (Fi < Fz) which couple to 
axion potentials with scales A, and Ag (A1 < Ag). The 
theorem states that 223, D31l R34 Kim (1999, 2000), I. 
W. Kim, J. E. Kim (2006): according to the diagonal- 
ization process in most cases with generic couplings, the 
larger potential scale A» chooses the smaller decay con- 
stant F}, and the smaller potential scale A; chooses the 
larger decay constant F>. So, just obtaining a small de- 
cay constant is not enough. The hidden sector may steal 
the smaller decay constant. Probably, the QCD axion 
is left with the larger decay constant. We can turn this 
around such that the hidden sector instanton potential is 
shallower than the QCD instanton potential since the in- 
stanton potential is proportional to the light quark mass 
as discussed in Subsec. [IIB] If a hidden-sector quark 
mass is extremely small, then the QCD axion can win 
the smaller decay constant and the other axion is an ex- 
tremely light axion which can be used to fit the observed 
dark energy Perlmutter et. al. (1998), 
Riess et. al. (1998), Komatsu et. al (WMAP Collabo- 
ration, 2008)] which is named as the quintessential axion 
Pal Kim, Nilles (2003)]. This can be easily realized if 
some hidden-sector squark condensations are very small 
as Fig. Z'7]can generate hidden-sector quark masses [23 
L-W. Kim, J. E. Kim (2006)]. 
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FIG. 27: The hidden-sector squark condensation breaks chiral 
symmetry and generate hidden-sector quark masses. 


Since it is difficult to obtain a reasonable light MD ax- 
ion, it has been tried to look for an approximate PQ sym- 
metry from string compactification. Only one reference 
exists from a realistic string compactification because of 
the difficulty of calculating all approximate PQ charges of 
quarks lo4, Choi, I.-W. Kim, J. E. Kim (2007)]. After all, 


the topologically attractive B;; may not be the QCD ax- 
ion we want. In this regard, we note that there already 
exists a field theoretic work regarding an approximate 
PQ symmetry, starting with a discrete Zg symmetry 
E Lazarides, Panagiotakopoulos, Shafi (1986)]. Later, 
the gravitational nonperturbative effects such as worm- 
holes and black holes were phenomenologically studied 
in view of any global symmetries [LZ iA [i74 Gid- 
dings, Strominger (1988), Lee (1988), Gilbert (1989)]. It 
is known that the PQ symmetry breaking operators in 
the superpotential "bud be forbidden up to dimension 8 
Ed Ld ld (od Bill Kamionkowski, March-Russell 
im Barr, Seckel (1992), Holman, Hsu, Kephart, 
Kolb, Watkins, Widrow (1992), Ghigna, Lusignoli, Ron- 
cadelli (1992), Dobrescu (1997)]. If we introduce an ap- 
proximate PQ symmetry, it is better to forbid the PQ 
symmetry breaking operators up to dimension 8 in the 
superpotential, possibly up to dimension 7 with reason- 
ably small couplings somewhere. 

In this spirit, it is worthwhile to check approximate 
PQ symmetries in string derived models. The MSSMs 
presented in 233, 234, 254 63d Bad £37, Kim, Kyae (2007), J. 
E. Kim, J.-H. Pad t Kyae (2007), Kim (2007a, 2007b)] 
satisfy most phenamenological constraints and one can 
check approximate global symmetries. But it is a tedious 
work, and so far only for the model of 31 Kim, Kyae 
(2007)] an approximate PQ symmetry has been checked 
out. In searching for an approximate global symmetry in 
a string derived model, there are so many Yukawa cou- 
plings to be considered that a complete study up to all 
orders is almost impossible. For example, in |94, Choi, 
I.-W. Kim, J. E. Kim (2007)] there appear O(104) d— 7 
superpotential terms and it is not a trivial task to find 
an approximate PQ symmetry direction, considering all 
these terms. Up to dimension 7 terms, there exists an 
approximate PQ symmetry which is spontaneously bro- 
ken. The resulting axion coupling with photon has been 
calculated in [94, Choi, I. p Kim, J. E. Kim (2007)] 
which is presented in Fig. [L6] to ethet with the CAST 
and Tokyo axion search een , 203, Andriamonje 
et. al. (2007), Inoue et. al. (2008)]. But the axion decay 
constant is not lowered. It is because the needed sin- 
glet VEVs, leading to the low energy MSSM, carry PQ 
charges. This is a generic problem for observable axions 
from superstring. Comparing this to the MI axion case 
with the anomalous U(1)an, it may be easier to realize the 
observable axion with an approximate PQ symmetry. 


VII. AXINO COSMOLOGY 


Supersymmetrization of axion models includes the 
fermionic superpartner axino à and the scalar superpart- 
ner saxion as discussed in Subsec. Both saxion and 
axino masses are split from the almost vanishing axion 
mass if SUSY is broken. The precise value of the ax- 
ino mass depends on the model, specified by the SUSY 
breaking sector and the mediation sector to the axion 
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supermultiplet ps3 Nilles (1984)]. Most probably, the 
saxion mass is around the soft mass scale Msysy. The 
axino mass should also be near this scale as well. But 
the axino "t3 can also be much smaller than Msusv 
fod, [i58 243, Frere, Gerard (1983), Kim, Masiero, 
Nanopoulos (1984), Chun, Kim, Nilles (1992a)] or much 
larger than Msusv m Chun, Lukes (1995)]. Therefore, 
we take the axino mass as a free parameter here. 

The decouplin 
plet is of order 


F, dich 
1 2 10H HE 2 
nas = 10% Gov (e e) (22) 


where a, is the QCD coupling constant. 


' temperature of the axino supermulti- 
, Rajagopal, Turner, Wilczek (1991)], 


(125) 


The saxion cosmology is a simple extension of the 
standard cosmology with saxion mass around the SUSY 
breaking scale | | 22d, J. E. Kim (1991), Chang, 
H. B. Kim (1996), Asaka, Yamaguchi (1999))], but 
its effect is not so dramatic as the effect of the ax- 
Ed cod bad | here we focus on the axino cosmology 

229, Bull Rajagopal, Turner, Wilczek (1991), 
m J. an Kim, Roszkowski (1999), Covi, H. B. Kim, J. 
E. Kim, Roszkowski (2001), Choi, J. E. Kim, Lee, Seto 
(2008)]. In the moduli stabilization scenario of Ref. 
the saxion VEV has been estimated in 
(2007)]. 


The axino cosmology depends crucially on the nature 
of R-parity. If R-parity is conserved and the axino is 
lighter than the neutralino, then most probably the axino 
or gravitino (in case of GMSB) is the LSP. If R-parity 
is not conserved, the neutralino can decay to ordinary 
SM particles, which has been discussed extensively in 
Ig, Allanach, Dedes, Dreiner (2004)] and references cited 
there. 


, 


, Choi, Jeong 


Now we focus on R-parity conservation. The neu- 
tralino, if it is the LSP, is a natural candidate for dark 
matter. Due to TeV scale sparticle interactions, the ther- 
mal history of neutralinos allows them to be dark mat- 
ter. But, imposing a solution of the strong CP problem 
via the axion, the thermal history involves contributions 
from the axion sector, notably by the axino. Since axino 
cosmology depends on neutralino and gravitino number 
densities, let us comment on the neutralino and gravitino 
cosmologies before discussing the effect of the axino. The 
neutralino cosmology depends on the neutralino freeze- 
out temterature [L48 , Lee, Weinberg (1977), Drees, 
Nojiri (1993)] and the gravitino cosmology depends on 
the reheating temperature after inflation , Weinberg 
(1982)]. Here we list several relevant temperatures in the 
axino cosmology 


Ta—dcep = axino decoupling temperature 
Tr = reheating temperature after inflation 
Tfr = neutralino freeze — out temperature (126) 
Ta=rad = axino — radiation equality temperature 


Tp — radiation temperature right after à decay 


Here, we are interested in the axino domination of the 
dark matter density. The cold axino dark matter possi- 
bility in the evolution history occurs either a heavy axino 
decayed already or axino has not decayed yet. If the ax- 
ino has not decayed yet, the current axino CDM can be 
estimated using Ta—acp or Tr. If it has decayed already, 
the past cold axino dark matter requires the existence of 
Tx at some earlier time 


4 : 
zmaYa(Ig") = Tp 


- (127) 


so that Ya(Tr) = na/s > Ya(Tg!") at the time of the 
reheating after inflation, where Tz" is the temperature 
above which axinos dominate the universe before they 
decay. 


A. Neutralino and gravitino 


The neutralino LSP seems the most attractive can- 
didate for CDM simply because the TeV order SUSY 
breaking scale introduces the LSP as a WIMP [54 Ezg 
Goldberg (1983), Ellis, Hagelin Nanopoulos, Olive, Sred- 
nicki (1984)]. The neutralino, which was in the thermal 
equilibrium in the early universe, decouples and freezes 
out when the annihilation rate becomes smaller than 
the Hubble parameter. The freeze-out temperature Tfr 
is normally given by m,/25 ; psa Lee, Weinberg 
(1977b), Kolb, Turner (1990)], e.g. 4GeV for 100 GeV 
neutralino. Obviously, the neutralino relic density is not 
affected by the axino for Tp > T, since neutralinos were 
in thermal equilibrium after the axino decay. This is the 
standard neutralino dark matter. With the axino intro- 
duction, therefore, we study the case Tp < Tfr- 

Gravitinos in the universe are important if they dom- 
inate the dark matter fraction now or they had affected 
the result of nucleosynthesis. Thermal gravitinos pro- 
duced at the Planckian time are important if m3/2 ~ 1 
keV psa Pagels, Primack (1982)]. However, in the in- 
flationary scenario these Planckian time gravitinos are 
not important now. For a heavy gravitino, it was ob- 
served that the gravitino decay affects the nucleosynthe- 
sis B63, Weinberg (1982)|, which was suggested to be 
solved by inflation bd Krauss (1983), Khlopov, 
Linde (1984)]. Then, the gravitino number density is 
roughly estimated in terms of the reheating temperature 
after inflation 13/5 x Tg. To estimate the cosmological 
bound on Tk rather accurately, a full supergravity inter- 
action , Cremmer, Ferrara, Girardello, van Próyen 
(1983)] has been used and applied to the dissociation 
problem of rare light elements such as deuterium, etc., 
resulting in Tg < 10? GeV [155, Ellis, Kim, Nanopou- 
los (1984)]. A recent calculation of Tg has been per- 
formed using the nucleosynthesis code to look for "Li de- 
struction and/or 9Li overproduction P17, Pod Kawasaki, 
Kohri, Moroi (2005, 2008)], following the earlier work 
of [Lid Cyburt, Ellis, Fields, Olive (2003)], which led 


Al 


to a stronger bound Tg < 108 GeV if the gravitino is 
lighter than the gluino and Tg < 10’ GeV if the grav- 
itino is heavier than the gluino. This gravitino prob- 
lem is absent if the gravitino is the next LSP (NLSP), 
ma < m3/5 < My, since a thermally produced gravitino 
would decay into an axino and an axion which would 
not affect the BBN produced light elements [d Asaka, 
Yanagida (2000)]. 

If the gravitino is the LSP with the stau or neutralino 
as the NLSP, the gravitino can be the CDM even in 
the constrained MSSM (or mSUGRA) for some param- 
eter space, avoiding the BBN and b — sy constraints 

Ed | (820, Boehm, Djouadi, Drees (2000), Ellis, 
Olive, Santoso, Spanos (2004), Roszkowski, Ruiz de Aus- 
tri, Choi (2005), Cerdeno, Choi, Jedamzik, Roszkowski, 
Ruiz de Austri (2006)]. 


B. Axino 


Thus, in SUSY theories we must consider a relatively 
small reheating temperature 10775 GeV. The axino cos- 
mology must also be considered with this low reheating 
temperature. 

In principle, the axion supermultiplet is independent 
from the observable sector in which case we may take 
the axino mass as a free parameter of from keV scale to 
a value much larger than the gravitino mass 
Chun, Kim, Nilles (1992a), Chun, Lukas (1995)]. Light 
axinos (ma < 100 GeV) can be a dark matter can- 
didate, and they have been studied extensively as a 
warm dark matter candidate Bul Rajagopal, Turner, 
Wilczek (1991)] with the reheating temperature given in 

, Brandenburg, Steffen (2004)], or a CDM candidate 

| fod, Eos B3d Covi, J. E. Kim, Roszkowski 
(1999), Covi, J. E. Kim, H. B. Kim, Roszkowski (2001), 
Roszkowski, Seto (2007), Seto, Yamaguchi (2007), Asaka, 
Yanagida (2000)]. Heavy axinos, however, cannot be the 
LSP and can decay to the LSP plus light particles. This 
heavy axino decay to neutralinos has been already con- 
sidered [104, Chun, Lukas (1995)]. The heavy axino pos- 
sibility was considered briefly in studying cosmological 
effects of the saxion by , Kawasaki, Nakayama, 
Senami (2007), Kawasaki, Nakayama (2008). A more 
complete cosmological analysis of the heavy axino has 
been discussed in [94 Choi, Kim, Lee, Seto (2008)]. 

Since the CDM fraction of the universe is roughly 0.23 
Komatsu et al. (WMAP Collaboration, 2008)], let 
us focus on the CDM possibility of the axino or axino 
related neutralino. 

For the axino to be the LSP, it must be lighter than 
the lightest neutralino and gravitino. In this case, we 
do not have Tp of Eq. (29). If the lightest neutralino 
is the NLSP, ma < my < maj», the thermal produc- 
tion (TP) mechanism gives the afore-mentioned bound 
on the reheating temperature after inflation. At the high 
reheating temperature, the TP is dominant in the axino 
production , Covi, J. E. Kim, Roszkowski (1999)]. 
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FIG. 28: The solid line is the upper bound from TP. The 
yellow region is the region where NTP can give cosmologically 
interesting results (QNT? h? ~ 1). The freezeout temperature 
is Tr BR >. 


If the reheating temperature is below the critical energy 
density line, there exists another axino CDM possibility 
from the nonthermally produced (NTP) axinos which re- 
sult from neutralino decay Covi, J. E. Kim, H. B. 
Kim, Roszkowski (2001)]. This situation is sketched in 
Fig. We note that with R-parity conservation, the 
double production of low mass axinos are negligible in su- 
pernovae, and hence from SN1987A there does not result 
a useful exclusion region in the low mass region. 

Since the final axino energy fraction is reduced by the 
mass ratio, Qah? = (ma/m,)0,h? for ma < my < 
m3/2, the stringent cosmologically constrained MSSM 
parameter space for m, can be expanded. As shown 
in this figure, the NTP axinos can be CDM for rel- 
atively low reheating temperature (« 10 TeV) for 10 
MeV < ma < my. In Fig. Dg] the thin dash line cor- 
ner on the RHS corresponds to the MSSM models with 
QA? < 10*, and a small axino mass renders the possibil- 
ity of the axino forming 2396 of the closure density. If all 
SUSY mass parameters are below 1 TeV, then probably 
Qh? < 100 (the thick solid-dash line corner on the RHS) 
but a sufficient axino energy density requires ma 1 
GeV. Thus, if the LHC does not detect the neutralino 
needed for its closing of the universe, the axino clos- 
ing is a possibility bà lod, fi) [13 Choi, Roszkowski, 
Ruiz de Austri (2008), Covi, Roszkowski, Small (2002), 
Covi, Roszkowski, Ruiz de Austri, Small (2004), Choi, 
Roszkowski (2006)]. If the NLSP is stau with the axino 
or gravitino LSP, the previously forbidden stau LSP re- 
gion is erased. In this case, the CDM axino is similar to 
the bino LSP case but it is easier to detect the stau signal 
due to the charged stau at the LHC EI, Brandenburg, 
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10? 108 104 


FIG. 29: The TR vs. ma plot for my = 100 GeV and Fy = 
10'* GeV. 


Covi, Hamaguchi, Roszkowski, Steffen (2004)]. However, 
the efforts to detect axinos may be difficult B23, H. B. 
Kim, J. E. Kim (2002)]. 

In the GMSB scenario, the gravitino mass is generally 
smaller than the neutralino mass and possibly smaller 
than the axino mass, for which case the cosmological ef- 
fect has been studied by my 21) Chun, H. B. Kim, J. 
E. Kim (1993), H. B. Kim, J. E. Kim (1995)]. 

For a heavy axino decaying to a neutralino, we present 
a Tg vs. ma plot for F, = 10!! GeV in Fig. The 
region Tg > Tz. acp is above the dashed blue line. The 
axino lifetime greater than 0.1 s is denoted by the red 
shaded region in the LHS. The blue shaded region in the 
RHS is where the axino decays before the neutralino de- 
couples (Tp > Tfr). The magenta lines (horizontal) are 
the contours of the entropy increase due to the axino 
decay, r = S;/So. Above r = 1 lines axinos dominate 
the universe before they decay. The green lines (verti- 
cal) denote the (c44,4,U,4;), Where Jann is the neutralino 
annihilation cross section, in units of GeV ? which are 
used to give the right amount of neutralino relic den- 
sity. In Fig. we use neutralino and gluino masses 
as my = 100 GeV and mg = 2 TeV, respectively. For a 
larger F, and a heavier neutralino mass, the green lines 
move to the right [97, Choi, Kim, Lee, Seto (2008)]. 
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Anomalous solutions to the strong CP problem 
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We present a new mechanism for solving the strong CP problem using a Z2 discrete symmetry and 
an anomalous U (1) symmetry. A Za symmetry is used so that two gauge groups have the same theta 
angle. An anomalous U(1) symmetry makes the difference between the two theta angles physical 
and the sum unphysical. Two models are presented where the anomalous symmetry manifests itself 
in the IR in different ways. In the first model there are massless bifundamental quarks, a solution 
reminiscent of the massless up quark solution. In the IR of this model, the 7’ boson relaxes the QCD 
theta angle to the difference between the two theta angles - in this case zero. In the second model, 
the anomalous U(1) symmetry is realized in the IR as a dynamically generated mass term that 
has exactly the phase needed to cancel the theta angle. Both of these models make the extremely 
concrete prediction that there exist new colored particles at the TeV scale. 


The smallness of QCD's 0 angle has been a mystery 
for many years. The physically observable angle is 


0 = 0 + arg detY, Ya (1) 


where 0 is the theta angle and Y,,,4 are the up and down 
type yukawas respectively. Measurements show that 0 
must be smaller than 10^!? [I]. This result is especially 
surprising considering that the CP violating phase in the 
CKM matrix is order one. As both CP violating phases 
have contributions from the yukawa matrices, it is sur- 
prising that one should be large while the other is so 
small. This difference is a fine tuning of ten orders of 
magnitude and begs a dynamical explanation. 

There are two broad categories of solutions to the 
strong CP problem. The first type are solutions based 
on the CP and P discrete symmetries. The solutions 
which use the CP symmetry start with a CP invariant 
theory and spontaneously break it in such a way that the 
theta angle vanishes at tree level while the CKM phase is 
large. The most well known of these types of theories is 
the Nelson-Barr mechanism [AB]. Other solutions based 
off of P involve doubling the matter content of the SM 
such that the opposite parity sector carries the opposite 
theta angle. Diagonal subgroups thus have non-zero 
CKM phases but vanishing 0. 

The second class of solutions are based off of anoma- 
lous symmetries. The idea behind these solutions is that 
in the UV there exists an anomalous symmetry which 
can be used to rotate away the theta angle and render it 
unphysical. These solutions are differentiated from each 
other by how the anomalous symmetry is realized in the 
IR. One popular IR realization of the anomalous sym- 
metry is the axion [H8]. In the axion solution to the 
strong CP problem, an anomalous symmetry is sponta- 
neously broken yielding a pseudo goldstone boson, the 
axion. QCD dynamics generate a potential for the ax- 
ion. At the minimum of the potential, the axion vev 
cancels 0. 

Another solution to the strong CP problem that uses 
an anomalous symmetry is the massless up quark solu- 
tion [9], which is currently disfavored by data [I0]. In 


the presence of a massless up quark, a chiral rotation of 
the up quark can remove 0 without changing any phys- 
ical parameters of the theory. 0 is thus an unobservable 
parameter. After confinement, there should be another 
dual description of how 0 is removed. This dual descrip- 
tion is accomplished by the 7’ boson. In the large N limit, 
the 7’ boson has a small mass and can be incorporated 
into the low energy effective field theory of the goldstone 
bosons. The low energy effective description for the y’ 
boson [II] is 


L = f2Tr0, EO Xt + af2Tr (mE + mix") (2) 
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where a is O(1), b is O(1/N) and m is the mass of the 
quarks. X is the non-linear sigma field describing the 
breaking of U(3); x U(3)g down to the diagonal, i.e. it 
contains the 7’ boson in addition to the usual pions. The 
n vev is stabilized around the theta angle. All additional 
higher dimensional operators are a function of 7 — 0 fr 
as required by the anomalous symmetry. Thus once 7/ is 
integrated out, the only place in the Lagrangian where 
0 appears is in the mass terms. If the mass is zero then 
the entire IR Lagrangian is independent of 0. This is the 
IR description of the massless up quark solution to the 
strong CP problem. 

In the absence of an up quark mass, the 7 boson has 
a shift symmetry that relaxes the 0 angle to 0. The dif- 
ference between the massless up quark solution and the 
axion solution is that the observed 7’ boson obeys 


Mn’, fy ad Agcp (3) 
while the unobserved axion typically has 


Tr Tq 


(4) 


Ma fa = faa ——— 
My + ma 
Both of these solutions are realized in the IR as scalars 
with a shift symmetry that renders theta unphysical. 

In this work, we consider a using a Ze discrete sym- 
metry in conjunction with an anomalous symmetry to 


solve the strong CP problem. The Zə discrete symmetry 
takes the Standard Model (SM) to another mirror copy 
of the SM!. Massless quarks are introduced in a manner 
such that there is an anomalous U(1) symmetry that re- 
sults in the sum of the two theta angles being unphysical 
while the difference is physical. The presence of a mir- 
ror copy of the SM allows for several unique methods to 
dynamically remove the massless quarks from the IR. 


We now present a simple theory which uses a Zə dis- 
crete symmetry and an anomalous symmetry to obtain a 
vanishing 0. We first start off with two copies of the SM 
related by a Za symmetry. For notational convenience, 
the mirror SM will have all of its fields and gauge groups 
primed. Because of the Zə symmetry, the two theories 
have equal theta angles and yukawa matrices. The Zə 
symmetry is spontaneously broken in such a way that 
the vev of the mirror Higgs field is much larger than the 
vev of our Higgs field. The mirror Higgs obtains a large 
“natural” vev while our Higgs obtains a small vev. The 
large ratio of the vevs is the hierarchy problem, which 
we do not address in this paper. How the Zo symmetry 
is spontaneously broken is unimportant for the solution 
to the strong CP problem but will become important in 
the context of higher dimensional operators as will be 
discussed later. 


After the two Higgses, H and H’, obtain their differ- 
ent vevs, the Zo symmetry is broken and the two theta 
angles are no longer required to be equal. RG flow from 
(H') to (H) generates a non-zero difference in the theta 
angles that is much smaller than the 107 !? experimen- 
tal bounds[13]. For all intents and purposes we can treat 
the two sectors as having identical theta angles. Any sub- 
tlety regarding the difference of the two theta angles can 
be removed completely by considering a supersymmetric 
model where the theta angle does not run. 


There are two simple ways in which an anomalous sym- 
metry can be added to the picture. The simplest way is 
to add a pair of massless bifundmentals Ypg and vg that 
are charged under SU(3), and SU(3)». It is simple to 
see that using the anomalous rotation of Yg and Yp that 
the two theta angles can be simultaneously set to zero. 
After H' obtains a large vev, all of the dual matter aside 
from Yg and Yp becomes massive. The SU(3). gauge 
group has only three flavors and confines. Due to chi- 
ral symmetry breaking, there is an octet of scalars. The 
octet of scalars is an octet under SU(3). and obtains a 
mass from gauge boson loops in much the same way that 
77 obtains a larger mass than 7° from loops involving 
the photon. These loops give the scalars a mass a factor 
of few below the mass of the p’ mesons of the SU(3)¢.. 


Like the case with massless quarks, what is important 
for the low energy dynamics is how the 7’ boson behaves 
and not the dynamics of any of the pseudo goldstone 


1 For a mirror world axion based solution to the strong CP problem 


see [12]. 


bosons. To the extent that N = 3 can be approximated 
by the large N limit, we can write an effective low energy 
theory of the n [LI] which is 


2 / 2 
ER "pg tue a. xo pen 
c= (0 T) FP : (n f, 8) deos (5) 


where the theta angles appear in a manner required by 
the anomalous U(1) symmetry. For simplicity's sake we 
do not include the non-linear sigma field X. Unlike the 
axion, the mass of the r is not set by the mass of the 
quarks but instead by the topological charge density [I4]. 


AN; E 
m2, =f |, e ASSI 6 
7 f2 d0? la —0,no quarks QCD ( ) 
where Ny = 3 is the number of flavors, fy ~ fr is the 
pion decay constant, and E is the vacuum energy density 
evaluated when 6 = ( and in the absence of quarks. 
From Eq.|5} we see that after integrating out 7’ that the 
SU (3). theta angle becomes 0 — 8 = —arg detY, Ya. 

In the IR, the SU(3). has a boundary condition at 
Agcp: where its @ is equal to zero. This solution is like 
the massless quark solution to the strong CP problem 
where the 7 dynamics are important. In contrast, in 
composite axion solutions to the strong CP problem it is 
the dynamics of a pseudo-goldstone boson that obtains a 
mass from the QCD scale and not the QCD' scale which 
is important. 

'There is a second way in which an anomalous symme- 
try can be added to the theory with two copies of the 
SM. We consider adding a complex scalar ¢, Ny vector- 
like flavors of fundamentals (v, Yo) for SU(3). and 


Ny vector-like flavors of fundamentals (Wå and Bo.) for 
SU(3)~. The flavors of quarks are exchanged under the 
Zə symmetry while ¢ is odd under the symmetry. We re- 
quire that the Lagrangian have an anomalous symmetry 
under which the flavors ~ have charge 1 and the scalar 
@ has charge -2. As before, this anomalous symmetry 
renders the sum of the two theta angles unphysical. 

After H' obtains a vev, all of the dual matter aside 
from V, Voi, the SU(3)« and the U(1)ga gauge 
groups are massive. The SU(3)» gauge group confines 
with 


i i0 i 
(bo6bg,;) ~ Abone i PNE (7) 


Because all of the matter content of the mirror has been 
integrated out, the theta angle of the SU(3)» gauge 
group is 6 =0+ argdetY,,Yq. Using a chiral rotation, 
we see that 0 appears in the quark condensate as shown 
in Eq.|7| Alternatively, one can note that the 7 boson is 
stabilized at (7') = 0 f... As mentioned before, even after 
the Zə symmetry breaking, 6 — 8 to very high accuracy. 

The most general potential consistent with the anoma- 
lous symmetry for the scalar ¢ and the massless quarks 


is 


Vo = yet (Uo ; vito.) (8) 
+ moo! + (609)? + c.c. 


where we have taken y real. After the VQ quarks con- 
dense, we see that o obtains a vev 


AP 7 
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($) =y 


We have made the simplifying assumption that A « 1 
and m > Agcp’. Using a phase rotation on ó we can 
see that the phase is exactly correct. This vev gives a 
mass ~ e~t? /Nr to the fermions Yh and Yoi After 
integrating out these quarks, we find that the SU(3). 
gauge group has 0 = —arg detY„ Ya. Thus the invariant 
theta angle of SU (3). is zero. The IR manifestation of the 
anomalous symmetry is a dynamically generated mass 
term that cancels the theta angle! 

In the previous calculation, we ignored the back re- 
action of the vev of ó on hbo.) by taking m to be 
large. To take into account back reaction, we use the 
parameterization 


(Udo ;) = Moore" 15i f(yó) (10) 


where f(y) is a real function that parameterizes the ef- 
fects of a non-zero quark mass on the fermion bilinear. If 
the quarks were massless, then we know that they con- 
fine so that f(0) ~ 1. On the other hand, if the mass of 
the quarks is larger than Agcp’, then the quarks do not 
confine and we have f(m 2 Agcp’) & 0. To find the vev 
of @, we need to solve the equations of motion which are 
approximately 


mg — ye A3 ope /Nr fF (yd) = 0 (11) 


From the various limits of f(y), we see that y(9) S 
AQcm so that the mass of v and VQ, i$ S Agom. 

We now consider the effect of higher dimensional oper- 
ators on these solutions to the strong CP problem. There 
are two types of operators which can cause an effect. The 
first kind are higher dimensional operator which break 
the anomalous symmetry used to make the sum of the 
theta angles unphysical. As we have learned from string 
theory UV completions of axion models, there are good 
reasons to expect that quantum gravity effects which 
break the anomalous symmetries are greatly suppressed. 
'The second more dangerous type of higher dimensional 
operators respect all symmetries. Dangerous operators 
include 


g? HH! 
327? 


~ AH -~ 
+ GG (12) 
Må Mj, 


If the pre-factor was not included, then if one used a 
chiral rotation to rotate this coupling into the yukawa 
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FIG. 1: The mass of the p’ meson in the mirror sector as a 
function of the mirror Higgs vev. The mass of the scalar color 
octet will be a factor of few below the mass of the p’. 


couplings, one would find that the operator is not sup- 
pressed by M7, times an order one constant but instead 
by an order 32z?/g? constant. Thus the pre-factor is 
needed if higher dimensional operators are to be sup- 
pressed by the planck scale and not by a parametrically 
lower scale. After the Zə symmetry is broken, this op- 
erator causes the difference between the theta angles to 
be non-vanishing. In order not to reintroduce the strong 
CP problem, this operator requires that the vev of H' be 
smaller than ~ 1014 GeV. 


Depending on how the Zə is broken, there may be di- 
mension 5 operators which impose stronger constraints. 
If the Z2 symmetry is broken by a real scalar 9 odd un- 
der the discrete symmetry [I5], then there is a dangerous 
dimension 5 operator 


g? 


3272 Myi 


(cG G'G) (13) 


If the only fine-tuning allowed in the theory is that 
needed to solve the hierarchy problem, then this oper- 
ator leads to the constraint that ® ~ H” be smaller than 
10° GeV, which results in new colored states with mass 
below 100 GeV and is experimentally ruled out. Thus 
we require that the Zə is broken in such a way that 
only dimension 6 operators can be written. The sim- 
plest way to avoid this problem is to instead introduce 
a scalar ® which is a doublet under a new SO(2) gauge 
symmetry and impose that it transform as ® > i un- 
der the discrete Za symmetry. In this way, the quartic 
?(H Ht — H' H") can be tuned against the mass term to 
result in a very light Higgs and no dangerous dimension 
5 operators can be written. 

'The phenomenological consequences of these solutions 
are tied to the strong coupling scale of the mirror sector. 
In the solution involving massless bifundamentals, the 
first signatures would be the color octet bosons. The 
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FIG. 2: Agcp’ as a function of the mass of the Ny flavors 
of quarks VO. The vev of the mirror Higgs is set to be 104 
GeV. The mass of the N; flavor of quarks is bounded by about 
3 TeV. The black, blue and red lines are N; — 1,5 and 10 
respectively. 


color octet obtains a mass 


m2, M 3asC2(adj) 2 


= " (14) 


at 1-loop which is about a factor of 4 below the mass of 
the p' gauge bosons. In Fig. [i] we plot the mass of the 
p’ meson (assuming that m,/Agcp = my/Aqgcp') asa 
function of the vev of the mirror Higgs. 

In the second solution involving a scalar and funda- 
mentals, the first signature would be NN; vector-like fun- 
damental quarks with mass S Agcp:. In Fig. [2] we plot 
Aocp: as a function of the mass of the Ny flavors of fun- 
damental quarks setting the vev of the mirror Higgs to 
1014 GeV. We see that there is an upper bound on the 
mass of these new quarks which is about 3 TeV. 

The constraint from higher dimensional operators on 
the vev of H' and the experimental bounds coming from 
the search for colored particles leaves open a very inter- 
esting region of parameter space. Due to higher dimen- 
sional operators, we have (H^) < 10!4 GeV. In the theory 
with a scalar octet, we find that the upper bound on the 
Higgs vev sets an upper bound of 8 TeV on the mass of 
the p’ meson. The mass of the scalar octet is then only 
a few TeV and potentially observable at the LHC. In the 
theory with additional flavors of quarks, the upper bound 
on the Higgs vev sets the quarks to be lighter than some 
order one number times 3 TeV. 
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Both of these simple models contain collider observ- 
able particles. Much like how the 7° decays into a pair of 
photons through the Wess-Zumino term in the chiral La- 
grangian, the color octet decays into a pair of gluons via 
the Wess-Zumino term. In the other model, the funda- 
mental fermions are stable because they are the lightest 
particles charged under a vector-like U(1). 

In both of these simple models, there is also a massless 
U(1)gw: gauge boson. Due to the structure of both the- 
ories, the kinetic mixing between U(1) zy and U(1)gw: 
is suppressed by many loop factors. As a result, this new 
gauge boson is not constrained by current experiments. 
Alternatively, the two Zə copies of the SM could share a 
U(1)y so that there is only one photon and no additional 
massless U(1)gy gauge boson. 

Both of these models require that there exist colored 
particles with mass in the TeV range. This connection 
to the LHC is extremely exciting as the LHC is probing 
this region of parameter space. The color octets give a 
4 jet signal with two resonances. This search is being 
done (e.g. see Ref. [I6]) and while bounds were not pre- 
sented for this particular model, a very rough estimate 
gives a bound of order 500 GeV or so. On the other hand, 
the fundamental fermions are collider stable colored par- 
ticles. While these fermions have cross sections smaller 
than gluinos, Ref. [I7] demonstrates that the bound on 
these types of particles is roughly a TeV or so. 

In this short note, we have demonstrated a new avenue 
for making 0 vanish. This approach uses a Z discrete 
symmetry and an anomalous symmetry. We have pre- 
sented two simple models that illustrates this new ap- 
proach to the problem. The key mechanism which allows 
this approach to work is the observation that confine- 
ment of the mirror SM allows for unique ways in which 
the massless quarks can be removed from the IR. They 
can be removed from the IR by confinement or by a dy- 
namically generated mass with exactly the phase needed 
to cancel ð. 
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ABSTRACT: We evaluate the efficiency of axion production from spatially random initial 
conditions in the axion field, so a network of axionic strings is present. For the first time, we 
perform numerical simulations which fully account for the large short-distance contributions 
to the axionic string tension, and the resulting dense network of high-tension axionic strings. 
We find nevertheless that the total axion production is somewhat less efficient than in the 
angle-averaged misalignment case. Combining our results with a recent determination of 
the hot QCD topological susceptibility [1], we find that if the axion makes up all of the 
dark matter, then the axion mass is Ma = 26.2 + 3.4 eV. 
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1 Introduction 


The QCD axion [2, 3] is a hypothetical particle, predicted in models which solve the 
QCD theta problem [4-6] via the Peccei-Quinn mechanism [7, 8]. In the simplest models 
(9, 10] it is the angular mode of a complex scalar, V2% = fe’ or 6, = Argy, which 
would be the Goldstone boson of a spontaneously broken U(1) symmetry, except that the 
symmetry is also anomalous (explicitly broken by QCD). Therefore QCD effects induce 
a small, temperature-dependent “tilt” in the potential, and therefore make the angular 
fluctuations massive, so the axion mass is ma # 0. The axion is a dark matter candidate 
[11-13] because early Universe dynamics generically generate large coherent oscillations in 
the axion field — essentially a Bose-Einstein condensate of axions at or near rest — which 
act as a pressureless fluid on scales longer than a few meters. 
In this paper we will predict the axion's mass, given the following hypotheses: 


1. the axion exists; 


2. PQ symmetry is restored either during or at some point after inflation, so that the ax- 


? 


ion field starts out “random,” meaning that its value at points out of post-inflationary 


causal contact are uncorrelated [14, 15]; 


3. The cosmological epoch where axions are produced - roughly, temperatures around 
1 GeV — follows standard FRW behavior with the expected standard-model matter 
content; 


4. the axion makes up 100% of the dark matter, so its current energy density is set by 
measurements of Qamh?; pam/s = 0.39 eV with s the entropy density [16]. 


These assumptions give rise to a rich dynamics, with a network of axionic cosmic strings 
[17] which collapses once the axion mass becomes large in units of the system age, mat > 1, 
through the action of axionic domain walls [18], leaving a final state with small-amplitude 
axionic fluctuations which evolve adiabatically thereafter. Under our assumptions, a mass 
prediction should be possible, because the model has one principal free parameter, the 
axion “decay constant" (the vacuum expectation value breaking the U(1) symmetry) fa 
(defined below in Eq. (2.1)). This parameter determines the axion mass, see Eq. (2.2). 
And given our other assumptions, it also sets the dark matter density. By computing the 
relation between f, and the dark matter density, we should then be able to predict f, and 
therefore Mma. Such a prediction is valuable because it informs experiment, and because if 
the axion is then discovered at this mass, it will clarify its role as the dark matter. 

In the next section, we lay out our methodology for relating fa to the axionic dark 
matter density. Section 3 presents our numerical results. We end with a discussion. A few 
technical issues and numerical tests are postponed to an appendix. But for the impatient 
reader, we present our main results here. While there have been numerous previous studies 
of this problem [21-30], ours is the first which includes the physics of the large tension 
associated with axionic strings. This large tension leads to a much higher density of 
strings which are more robust and survive longer than in previous simulations. However, 
this makes surprisingly little difference in the final axion number produced. Let us set 
as a baseline for axion production, the angle-averaged misalignment value of the axion 
density. This is the axion density value we would find if the axion field starts out uniform 
in space with value ĝa, averaged over 04 € [—7, m] (without the approximation, sometimes 
made, of replacing 1 — cos 0, — 02/2 in the potential)”. At a given fa value, we find the 
axion number density produced in the inhomogeneous case is actually smaller than the 
misalignment value, by a factor of about 0.78. Since pam increases with increasing fa, this 
inefficiency must be compensated by a larger value of fa, and hence a smaller value of Mma, 


than has generally been assumed; we find Ma ~ 26.2 + 3.4 ueV. We postpone discussion of 
this result and its errors to the conclusions. 


'There are two other relevant parameters. There is the number Na of minima around the U(1) circle, 
cos Argy — cos Na Arg. But if Na # 1 then the model predicts stable domain walls which are a 
cosmological disaster [19, 20]. Also there is the mass of the radial excitation in the complex scalar field, m. 
This must be heavy, and we find below that the results are quite weakly dependent on its exact value. 

Specifically, we write the conformal-time axion mass squared as m2 = t^^? /t}+4 (t, n, and t, defined 
in the next section) and evolve 0(t) according to d?0/d£? + (2/t)d0/dt = —m2 sin0 to a time t > 4t,. The 
axion number density at time t is nax = (m0? + 0? /m)/2. We average over starting values 0 € [—7, 7] and 
use the resulting max average to normalize the result of a string simulation at the same t/t. value. 


2 Methodology 


Our approach will be as follows. The Lagrangian for the axion field is? 


m2 


8fz 


The middle term is the symmetry breaking Lagrangian, and the last term is the “tilt” in 


2 
- L = g" 0, pd, + (2p A) - x(T) (1 — cos Oa) . (2.1) 


the potential due to QCD effects. This tilt gives rise to an axion mass of 


m2(T) = Pu mi(T-0) = d 


In a radiation dominated FRW universe, in comoving coordinates and conformal time, the 


(2.2) 


metric is guy = Puno and the temperature is T œ t~!, so 
m2 


8fz 


Here x(T(t)) is the topological susceptibility. Model calculations [31] and a recent lattice 


2 
— Vg Loont = C c Vg Vote (2e*e — f) + x(t) (1 — cos )) « (2.3) 


calculation [1] indicate that x(T is approximately power law between 1.5 GeV and 400 
MeV, which, we will see, is wider than the relevant temperature range we need. Therefore 
we will treat x(T) as a power law, x(T) x T7”, so t?x(t) = f2t"*?/t*^, with t, the 
natural scale where the susceptibility begins to influence the dynamics; txMa,conf (tx) = 1 
where Ma,cont = 4/t?x(t)/ fa is the conformal-time axion mass. In terms of physical time, 
t, is the moment when m4H = 1. In the following we will suppress the subscript and write 
Ma,conf = Ma, except in the discussion. That is, masses and times will always be expressed 
in conformal units. 

'To initialize the network, we choose an independent random phase at every lattice site. 
We then evolve the fields for an initial time under strong damping (Q4-29/t > + kyaxo/t 
for times t < tstart) to prepare a string network relatively close to the scaling network 
density. The length and strength of damping is chosen such that the string network will 
roughly match on to the scaling network density; we will also study the dependence on the 
initial conditions below. 

'The model has two sorts of metastable defects, strings and domain walls. A string is 
identified as a linear structure where 6, changes by 27 in circling the string. A domain 
wall is a surface on which 0, = a; each string has one such domain wall ending on it. 
The domain walls only become distinct structures once mat >> 1; the surface tension of an 
isolated domain wall is c = 84 f2, which grows rapidly with time. Therefore the domain 
walls straighten out and pull the strings together, annihilating both networks and leaving 
small fluctuations in the ĝa field. We evolve all fields until this dynamics is complete and 
there are no strings left. Then we count the axion abundance by extracting 0, and d0,/dt 
from the simulation and applying the method of [29] to determine the axion content. This 
determines the total density of axions from all sources — we make no attempt to distinguish 
which axions arise from strings, from walls, or from misalignment, as we do not believe 


3We use [-+++] metric convention, and standard complex-field normalization y = (pr + ipi)/V2. 


such a distinction can be made unambiguously. We express the axion number produced as 
a ratio to the angle-averaged misalignment value and we determine f, such that the dark 
matter abundance is correct.* 

The scale m for “radial” excitations in the y field is may be as large as m ~ fa ~ 
10! GeV and must be at least 10? GeV (see Subsection 3.4), while the relevant length scale 
is H at the QCD epoch (around 1 GeV temperature), which is of order 107 !5GeV. We 
handle this huge scale hierarchy by observing that the only important physics it gives rise 
tois very thin, high-tension axionic cosmic strings. Specifically, the string tension should 
be Tis ^ Ka f2 with « = In(m/H) € [50,70]. We address this physics via the technique we 
recently introduced [32]. Specifically, we add abelian-Higgs degrees of freedom which are 
massive away from string cores but which induce a large string tension. In the Appendix A 
we review the procedure and explain how we implement x(T)(1— cos ĝa) into this method. 
'The outcome is that the string tension is maintained by some extra, massive degrees of 
freedom, but the mass scale m for these degrees of freedom must be resolved by the lattice, 
ma, € 1 with a the lattice spacing. The correct physical limit involves this scale becoming 
heavy compared to the physics of IR fluctuations in the axion field. Since the correct 
physical picture arises when m is large, we will hold ma fixed, that is, we keep m fixed 
in lattice units, throughout a simulation. In our implementation, the extra degrees of 
freedom introduce one new parameter qı, which determines the value added to «; most 
of our results are labeled (q1,q2) = (4,3), which means that & has been increased by 50 
through the added degrees of freedom [32]. 


3 Numerical results 


In Appendix B we show that our results are in the large volume limit if we keep L/t, > 5, 
and we show that the axion number does not evolve after the string network is gone and 
can be measured as soon as no string is left, without concern that it will evolve further. 
Therefore box size and axion number measurement do not contribute to our error budget. 
Here we will instead focus on those effects which still do. 


3.1 Lattice spacing 


For axionic strings to evolve correctly, the string core must be resolved by our lattice 
spacing. We need to check that our lattice is fine enough, in the sense of ma the product 
of the heavy scale and the lattice spacing is sufficiently small. That said, numerical cost 


3, so we want the largest value which 


scales as (ma)~* and required RAM scales as (ma)^ 
we can get away with. To test the (ma) dependence, we fix all other parameters in terms 
of m, and we consider axion production at various ma values in Figure 1. Because we have 
used an improved action, the result should naively converge in the small ma limit with 


4 motivating the axis choice in the left-hand plot. However, 


corrections vanishing as (ma) 
the right-hand plot shows that the data fit better assuming (ma)? dependence. Indeed the 


X? for an (ma)? fit is about 1, while for an (ma)? fit, X? = 15. Therefore we will assume 


^We implement the misalignment case in the same code by turning off the scalar gradient terms. We 
also implemented misalignment in a simple dedicated code as a cross-check. 
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Figure 1. Dependence of the axion production rate on the lattice spacing a, expressed as a 
function of ma with all else held fixed in units of m. 


that the errors are quadratic in spacing, despite our improved action. The fit indicates an 
upwards correction between (ma) = 1 and the continuum limit (ma) = 0 of 10 + 196. In 
the rest of this study we will use ma = 1 and correct the final results upwards by 10%. 

It remains to explain why the axion production scales with (ma)? despite our improved 
action, which should give (ma)^ convergence if fields are smooth. We believe this occurs 
because a small fraction of string has a velocity close to 1, and therefore a large Lorentz 
contraction factor. If the (energy-weighted) fraction of string with velocity-squared v? > 
ve only vanishes linearly in the ve — 1 limit, then the fraction of string with 47? < € 
would then scale linearly in e. Such scaling is consistent with our measured string velocity 
distribution. It also makes sense from the string equations of motion. In flat space, with 
x = 0, and in the Nambu-Goto limit, labeling the string location as z;(c,t) with o an affine 
parameter along the string, one can make a gauge choice such that «;2/, = 0 and such that 
vata, /(1— £;4;) = 1. The equation of motion is then 


äi = a! (3.1) 


which is solved by [33-35] 


a! - a;(o+t) LA , (3.2) 
p= aila +t) ; Bi(c—t) (3.3) 
ailrjai(r) = 1 = bi(r)bi(r) vr. (3.4) 


Here a, 8 are backwards and forwards propagating waves which take values on the unit 
sphere. Even in curved space we may satisfy the gauge choice instantaneously. The relevant 
question for the distribution of string velocities is the distribution of angles between a and 
B, since v? = i? = (1 — a : 8)/2. The measure of o - B values is uniform in [-1, 1] because 
a, B take values on the unit sphere. While we do not expect the distribution of a- 8 values 
to be uniform in [-1, 1], neither do we have a reason why it should avoid a- 8 = —1, so the 
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Figure 2. Dependence of the final axion density on the initial string network density. We measure 
the starting network density as the ratio between the scaled density £ at time mt = 128 (t/t, = 0.32) 
and the scaled density at time mt — 1024, shortly before the walls start to influence the network 


evolution (t/t, = 2.56). 


probability distribution should not vanish at œ- 8 = —1, and therefore the fraction of string 
with y7? < e should indeed vanish linearly in small e. Consider e = (ma), corresponding 
to a gamma-factor of y > 1/ma and therefore a Lorentz contracted string thickness? of 
y-1(1/m) < a. Our hypothesis for string velocities then states that an O((ma)?) fraction 
of string should be Lorentz contracted to a thickness of less than 1 lattice spacing. Such 
string is mistreated regardless of how improved our update algorithm is. Therefore, even 
if typical string is treated correctly with O((ma)*) errors, the fraction which is mistreated 
is of order (ma)?. This allows (ma)? scaling corrections, regardless of the level of lattice 


action improvement. 


3.2 Initial network density 


We want the axion production from a string network which is initially in the scaling regime. 
But this cannot be exactly achieved; initial conditions will typically produce a network 
which is either denser or less dense than scaling. The network evolves towards scaling, and 
if mt, is large enough then initial conditions should have little effect. But it would still be 
good to check how sensitive the final axion number is to the starting conditions. 

We address this in Figure 2. We introduce the scaled network density 


p ene Val 
m string T (3.5) 


d i AVspace í 


°In this parametric argument we are neglecting order-1 factors which make the string somewhat thicker 


than 1/m and mean that, for ma = 1, most string is actually properly treated. 


with y the local gamma-factor of the string so that the integral represents the total invariant 
length of string (length scaled by a y factor to account for the energy content), and with 
Vepace the volume of the simulation. This combination should approach a fixed “scaling” 
value as £ increases (for x(T) = 0, that is, in the absence of potential tilt). We measure £ 
once early in an evolution and again later in the evolution, just before the walls start to 
influence the string network evolution. We perform several evolutions with initial conditions 
with more or less damping, leading to denser or rarer initial networks. The ratio of the 
starting to final £, £init/£iate, then indicates whether the network started too thin or too 
dense, and therefore from which side it is approaching the scaling solution. We find a 
roughly linear correlation between this starting density and axion production, with more 
axions arising from denser starting networks. However the dependence is quite weak. Based 
on the figure, we will try to use initial conditions with this € ratio close to 1, and we will 
assign a 596 systematic error based on incomplete network scaling. 


3.3 Thin-core limit 


Next we must consider the effects of finite mtą, meaning that the strings are of finite 
thickness. This is clearly an artifact because in the physical case there is a hierarchy of 
many orders of magnitude between string thickness and axion mass. We have incorporated 
the logarithmic sensitivity to this hierarchy by implementing auxiliary fields to give rise 
to the resulting high string tension. But there can still be effects suppressed by powers 
of 1/(mt,), probably starting at first order. In particular, strings may lose energy via 
the radiation of unphysical massive modes. We only expect such radiation from short 
length-scale structures on the strings, which should generally get smoothed out by axion 
emission so long as mt, > k and mg/m « 1. However, because ma grows as a large 
power of conformal time, ma œ t!^2, the latter condition may not be maintained, given 
the persistence of high-tension strings. And if the axion mass m, comes of order the 
heavy-mode mass m then one might expect that axion production is lost to heavy-mode 
production, and the simulation could result in an underestimate of axion production.? 
We can “fix” this problem by artificially capping the value of ma, so that rather than 
growing with time at all times, t?x(t) grows up to some value and then becomes constant. 
But this replaces one unphysical behavior with another, and it will introduce new artifacts. 
The axionic wall tension is proportional to ma. These walls cause the network to collapse, 
and limiting their tension artificially extends the life of the network. We show this effect 
in Figure 3. The figure shows how the total length of strings, rescaled as in Eq. (3.5), and 
a similar rescaled wall area (without y factor or the conventional factor of 4), 
2 
wall E t Jat wall dX (3.6) 
Space 
evolve with time under the influence of various choices for a maximal m, /m value. We see 
that the wall area starts to decline as the wall surface tension turns on around £ — 1.6t,, 


ĉIn previous work [29] we showed that, for a theory of a pure scalar field, the axionic domain walls 
spontaneously decompose as soon as (ma /m)? > 1/39. The added degrees of freedom in our string cores 
prevent this physics from occurring; the domain walls remain strongly metastable up to and past ma /m = 1. 
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Figure 3. Density of walls and of strings as a function of t/t, when the axion mass is artificially 
limited with 5 different limits. 


and later around t = 2.8t, the surface tension becomes large enough to influence the string 
network evolution, drawing together the strings and collapsing the network by t = 4t,. 
However, artificially limiting the axion mass slows down the collapse of the network; for 
the smallest value we considered, the last bits of string survive almost to t = 5t,. 

We also expect that the network with a maximal m,/m value will produce more axions 
than without such a cutoff." The reason is that, as m, increases, the energy stored in the 
string network becomes less and less useful for producing axions. While increasing Ma 
increases the energy in axion fluctuations and in domain walls in proportion to Mma, it does 
not change the energy in strings. Therefore, as Mma increases, the capacity for strings to 
make axions is diluted; since making an axion costs energy ma, an energy E can only 
produce E/mq axions. Limiting Ma turns off this dilution, allowing the string energy to 
produce more axions, and could therefore result in overproduction of axions. 

The issue should disappear if we can reach a large enough value of mt,. But it is useful 
to consider different cutoff values for m/m and take the large mt, limit for each. If the 
continuum limits are the same, then it lends credence to the belief that we have achieved 
the continuum limit. According to Figure 4, T'he difference between different cutoff choices 
falls below 1096 starting around mt, — 300 (third-from-leftmost points). On the two still- 
finer lattices, the choices (mq/m)?,,, = 0.5 and (m,/m)?,,, = 0.2 agree to within 396. So 


max 
these values can be close to the continuum limit. Note that the last point in the figure, 


"Something special happens if we choose a maximum value for m, which is very close to m /2. In this 
case, there is a resonant nonlinear mode-coupling process which converts mass m/2 axions into mass-m 
excitations, leading to a reduction in the axion production for values of (ma/m)max very close to 0.5. This 
effect is clearly an artifact, so we avoid this special value. 
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Figure 4. Axion production as a function of the continuum limit 1/mt, — 0 at three values of 
the artificial limiting value for the axion mass (m4 /m)iax. 


with mt, = 625, was achieved by loosening ma from ma = 1.0 to ma = 1.25 and using 
the result of Figure 1 to extrapolate it to the same value as the other points; it is also at 
a slightly smaller physical volume, Lt, — 4.1 rather than 5.1. 

'The error bars shown in the figure are statistical only. However the statistical errors 
for points with the same mt, value but different (m, /m)2,,, values are strongly correlated, 
since they are calculated from simulations which are identical up to the point when m, 
reaches the smaller upper-bound value. Therefore the determination of the difference be- 
tween different (ma/M)Ž ax choices has smaller errors. In particular this difference is not 
linear in 1/mt., but drops to a small value at a sufficient Ma axt. value. That complicates 
the continuum limit. Here we will perform a linear extrapolation of the three smallest 
1/mt, data points, each for (ma/m)?,,, = 0.2 and 0.5. We find 0.696(46) and 0.729(41) 
respectively. The fact that these answers are not the same indicates that our lattices are 
not yet abundantly fine. We assign a 1096 error bar for the continuum-extrapolated value, 
to include these systematic issues, adopting nax/Nmisalign = 0.71(7). This is for n = 7, 
k = 50 from extra degrees of freedom, and before performing the small ma extrapolation. 


3.4 String tension and temperature-dependent susceptibility 


Having discussed numerical artifacts, we now turn to actual physical parameters which are 
relevant but not completely known: the string tension « and the strength of the temperature 
dependence n in x(T) x T". 

The slope n is calculable in lattice QCD. Recently Borsanyi et al have presented [1] 
results up to and beyond the relevant temperature range. Using their results at 600 and 
1200 MeV, we estimate n = 7.6 + 0.5. Most groups find results which agree with Bor- 
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Figure 5. Dependence of the relative axion production efficiency on the parameter n, controlling 
how quickly m, rises with t. For small n (gradually rising ma) the network produces more axions 
than in misalignment; for large n it produces fewer. 


sanyi et al at lower temperatures [36-41], although no group has reproduced these higher 
temperatures and even below 600 MeV there are some results which appear discrepant 
(42, 43]. Therefore we will explore other n values but consider values near n = 7.5 to be 
likely correct. We also feel that we gain some physical insight by considering different n 
values, especially much smaller values. We do this in Figure 5. The figure shows that small 
n values lead to more axions than in the misalignment mechanism, while large n values 
lead to less. But between n — 7 and n — 8 the dependence is not very strong. Therefore 
our choice to use n = 7 elsewhere, which we made mostly for simplicity, does not appear 
to be very critical. 

We interpret the results of Figure 5 as follows. The larger the n-value, the more rapidly 
the axion mass Ma turns on, and therefore the heavier the axion is when the string network 
breaks up and loses its energy. That means that for small n, the network can still produce 
relatively many of the relatively-light axions, but for large n the axions quickly become 
heavy and the string energy cannot produce a large number of them. This is consistent with 
what we saw when (m4/m)2,4, was small. Indeed, the results at n = 2 had (m/m)? = 0.07 
at the time the string network had completely disappeared, so walls broke up and axion 
production occurred when axions were still relatively light. 

Finally we consider the & value. Above we define & as k = In(m/H). For us H = 1/t 
the inverse system age. Therefore the contribution from axionic modes to & is In(mt), which 
we approximate to its value at t = 3t, since this is when the string network is breaking 
up. In addition there is a contribution from the extra massive degrees of freedom we have 
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added, so our simulations have 
n = ln(3mt,) + 2(q7 + 2) , (3.7) 


where the charges (q1, q2) are explained in Appendix A. 

We do not know what the physical value of & should be, because we don't know the 
model-dependent microscopic origin of the axion field. In the single complex-field case 
[9, 10] we don't know the radial mass m; if the axion is a composite or arises from more 
complicated physics [44], we do not know the compositeness scale and whether there is an 
extra contribution to the string tension from the microscopic physics giving rise to the axion 
field. We can reasonably guess that m < fa ~ 2 x 101! GeV. Also the requirement that the 
radial excitations decay by the time the Universe reaches a temperature of 1 GeV, along 
with an estimate for their decay rate [45], Tm ~ Wem, sets very roughly m > 10? GeV. 
These limits correspond to approximately & € [48,67]. For mt, = 300 and q, = 4 the & 
value in the simulation is 50 + In(900) = 57, which is in this range. By considering other 
values of (q1,q2), we achieve « values larger and smaller than the physically interesting 


range. 


String density vs Tension 


Axion Production vs String Tension 
ri it 2 se ) “et ot mes PS yo 7T 


dis 0.7 H physical | 
4 E (299): | range | l 

Da4 E] F i i 
z T i i 
n E! : 
fæ E oesL ; 
o g "[ (5.4) 
p a d d 
Qn e [ : | 
B > oep | (43) 
5 2 -Bottom to top: E [ 
i «=16.8, 32.8, 56.8, 88.8 E 
E 9 ul | 
1 B.C l (3.2) mt,-300 ma-1.0 
pcd 1880 box: tens. 12 

Kaart=24 mt,,780 05 L LG), f ELE d E E. 

—— M i "0 20 40 60 80 100 
0 1 2 3 4 
Time t/t, « (string tension) 


Figure 6. Left: string density as a function of time for different « values. The higher the string 
tension, the longer the strings persist. Right: axion production efficiency as a function of &. 


We show results for the axion production as a function of & in Figure 6. The figure 
shows that higher tension strings give a significantly denser string network, with strings 
which break up later, but nevertheless produces only mildly more axions (note the false 
zero for the y-axis in the right plot). Therefore our ignorance of the physical value of m, K 
is not very significant in bracketing the physical value of the axion. A simple linear fit to 
the left frame in Figure 6, and the range we quoted above for &, gives a systematic error 
of +3% due to the unknown value of &. 

Note that the chosen initial conditions for both our n dependence and our & dependence 
studies produced somewhat underdense networks. One can see in the left frame of Figure 
6 that the underdensity is worse for the highest tensions, so the true «x dependence is 
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somewhat underestimated. Also note that the larger & values are farther from the large 
2 


max 


mt, limit, leading to a slight overestimate in the produced axions due to (m, /M)f ax effects. 


4 Discussion 


If we take the temperature dependence of the topological susceptibility to scale as x(T) cx 
T^" with n = 7.6 in the relevant temperature range [1], and assume that axionic string 


cores arise at a mass scale m ~ 10" GeV so the extra string tension is k = 58 + 10, then 
our results indicate an axion production efficiency which is 0.78(12) times as efficient as 
in the angle-averaged misalignment mechanism. The indicated error is dominated by the 
extrapolation to the large mt, limit, with the uncertainty due to « added linearly (not in 
quadrature). 

Now we use this result to calculate the axion mass. There has been relatively little 
entropy production since the Universe was 1 GeV in temperature, so the ratio of axion 
number-density to entropy density is approximately the same at the end of the axion- 
production epoch as it is now. We can express our results by saying that the axion number 
density, determined at t = 4t. or T = T./4 and then back-extrapolated to the temperature 
T. where m(T.) H(T.) = 1, was 


back-extrapolated nax(T = Tx) ~ KH(T,)f?, (4.1) 


where a numerical evaluation finds that the angle-averaged misalignment value for K is 
K = 16.61, and our result is K = 13.0+2.0. By dividing this by the entropy density at that 
temperature, s = 29,777T3/45, we get the modern axion-number to entropy ratio, which 
can be multiplied by the vacuum axion mass Ma = y X(T = 0)/ f, to give the modern dark 
matter density to entropy density ratio. We combine this with the Planck result |16], 


P* s 8.59 x 1071, 
S 


Pim _ Namh? mpny _ 0.1194 


-lly , 
; 5 ~ 6555; (938 MeV)(8.59 x 10717) ~ 0.39 eV, (4.2) 


thermal QCD results for the entropy density s and energy density £ of the thermal plasma 
from Borsanyi et al [1], 


2 155. 2r? T’ gy 
e(T)= =r, s(T)-7—,  &GeV)er73, (4.3) 
1 GeV 76 
x(T) = ( T ) (1.02(35) x 1071! GeV4), (4.4) 


Hubble's law H? = 87 e/(3m*,) with mp œ 1.22 x 101? GeV, the thermal value for the 
axion mass m2(T) = y(T)/f?, and vacuum value x(T' = 0) = (0.076 GeV)* [46], to obtain 


a 


fa = (2.21 € 0.29) x 101! GeV, (4.5) 
Ma = 26.2 + 3.4 peV, 
Ty = 1.54 + 0.05 GeV . 
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Taking the errors quoted in Ref.|1] at face value, the dominant error in fa and hence in m, 
is from our determination of K, while the error in 7, arises equally from the errors in K 
and in x(T). Eq. (4.5), Eq. (4.6), and Eq. (4.7) constitute the main results of our study. 

Our most striking result is that the axion production from random initial conditions, 
with the resulting dense and high-tension axionic string network, is actually smaller than 
the angle-averaged misalignment value. The deficit gets larger at large n, where the 04 
potential tilts more abruptly; if it tilts more gradually then the axion production exceeds 
the misalignment value. Furthermore, although axion production is larger from high- 
tension strings than from strings with a lower tension, the dependence is quite weak; a 
factor of 10 increase in string tension between our results and the results of [29], along with 
the resulting factor of 3 increase in the string network density, has led to less than a 30% 
increase in axion production. 

This clearly requires some explanation. The conventional wisdom has been (see for 
instance [28]) that axions are produced by misalignment in the space between walls, by 
walls, and by strings. Therefore the production is the sum of three terms, and must be 
larger than the misalignment contribution. We argue that this picture involves assumptions 
and commits double counting. It does not make sense to consider misalignment axions to 
be independent from walls. Within the misalignment mechanism, half of all axions emerge 
from the range of angles |04(t = 0)| € [2.76, 7]. But it is precisely the regions with 0, ~ 7 


? 


which become the domain walls. Much or most of the “misalignment” axion field energy 
becomes the domain walls; it is double counting to speak of both domain-wall axions 
and misalignment axions as independent contributions. Of course, since the axion field is 
initially very inhomogeneous, it is also not obvious that there are any spacetime regions 
where homogeneous misalignment is a useful description. 

Consider also what happens to the energy in domain walls. After the potential tilts and 
the domain walls become relatively thin and distinct, the wall surface tension induces forces 
on the strings. The walls lose their energy to accelerating the strings, which consumes the 
wall area (see Figure 3). Also in this epoch, it is not simple for walls or strings to emit 
axions. The axion frequency m, increases with time, and any process involving time scales 
longer than m;! has a frequency-mismatch problem to produce massive axions. That is, 
long wavelength fluctuations of walls or strings are incapable of producing axions because 
they drive the axion field at frequencies below ma. We saw this very clearly in our previous 
study of 2+1D axion production with massive strings [30]. 

What about the energy of the string network? The high string tension means that the 
network stores much more energy. But after the time scale t,, the energy in domain walls 
and in axionic fluctuations increases with the axion mass as E c m, « px while the 
string energy does not increase as Ma increases. Therefore the string network's ability to 
produce axions dilutes with time. The network only annihilates when the walls are able to 
influence string dynamics, which occurs when the wall energy is comparable to the string 
energy. That is, the strings only fragment when their energy is comparable to the energy 
that was present in the wall network which caused them to fragment. And there is still the 
question of how efficiently the resulting small loops turn their energy into axions. 

To improve this analysis, we see a few directions which need to be pursued. First, we 
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need simulations with more RAM, so that larger boxes, and therefore larger mt, values, 
can be studied. We need to be more systematic in setting the initial network density and 
understanding the approach to network scaling. It should be straightforward to reduce 
statistical and extrapolation errors to the few percent level, with the dedication of more 
computer power. 

Also, we would like to investigate some of the late network evolution in more micro- 
scopic detail. The string network breaks up into loops which then annihilate in a way which 
somehow does not produce many axions. It should be possible to cut such loops out of a 
simulation and resolve them with a much finer lattice, which can then properly separate 
the Ma and m mass scales and follow the loop dynamics down to short scales. This could 
help explain why so few axions are produced (or determine whether our limited lattice 
spacing is causing a systematic neglect of some relevant but shorter-distance physics). 
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A  Algorithmic details 


Here we explain in more detail how our numerics work. Following Ref. [32], we embed the 
axion field as a global U(1) symmetry of a theory with two scalars y1, Y2 and one U(1) 
gauge symmetry, so one linear combination of the U(1)xU(1) symmetry is gauged and one 
is global. Both are spontaneously broken by the scalar vacuum values: 


1 2 . 2 
—L (v1, 2, Ay) = gat FU + [c = inA)e| xs |i = ig Ay)2| 
2 2 2 2 
ui * 2 ma * 2 
+ 8v2 (2210 = v?) + 802 (22542 = v3) : (A.1) 


Here q1, q2 are the field charges with qo = qı — 1. The axion is the angle 
04 = g2 Arg 1 — qı ATE p2 (A.2) 


which is gauge-invariant. This procedure exactly reproduces the global symmetry and the 
way strings act as a source for the axion field; the dynamics are modified only by a large 
induced string tension, & ~ 2(q?+q3), and heavy degrees of freedom which should decouple 
from the dynamics in the continuum limit (in the sense of mt >> 1, or mt, >> 1 for our 


current purposes) [32]. We consider v; = v2 and m? = m2 = e?(q2v + q2v2) = m? so all 


heavy fields have a common mass. The axion decay constant is f? = v2v2/(v2q? + v2q2). 
In our lattice units we normalize our fields such that vı = 1 = ve. The topological 


susceptibility part of the potential (which breaks the global U(1) symmetry, “tilting” the 
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potential for the axion field), is implemented as 


t?(t) (1 —cos Arg 6.) — PIE F (29391) F (29592) (1 — cos (o Arg yı — qı Arg p2) ) 
(A.3) 
25,,(8 _ 4 
nos BG r), du. (A.4) 
1 3 CS 5: 


The function F (r) is inserted to soften the behavior of the susceptibility term in string cores; 
without this term the introduced potential becomes violently nondifferentiable wherever 
y* ~ 0 for either field, which causes problems for space-discretized equations of motion. 
'The modification only changes the tilted potential inside string cores, where its effect is very 
subdominant to the leading potential terms. But without this modification we do not get 
consistently stable evolution near string cores. Our results are insensitive to the specific 
form of F(r), provided F(1) = 1, F'(1) = 0, F(0) = 0, and F'(r) is continuous, which 
motivated our choice. A similar modification is common in single-scalar simulations of 
axionic strings, where most authors [19, 28] have made the substitution (1 — cos Arg p) > 
V2 Re o, a substitution which is correct only for the angular dynamics and only where 
2q*q = 1. This replacement is justified because it is simpler, is nonsingular at 0, and is 
only of much influence outside string cores, where it is nearly equivalent to the correct 
form. We have explicitly checked that in the single-scalar model, axion production and 
string dynamics are nearly indistinguishable whether we use (1— cos Arg y) or V2 Re y as 
the “tilt” in the potential. 

Our numerical implementation uses a standard leapfrog algorithm and the noncompact 


? improved action for both 


formulation of U(1). The only novel feature is that we use an a 
the scalar and gauge parts, which requires a somewhat nontrivial treatment of electric fields 
in which the link's canonical momentum is not the same as the link’s time derivative [47]. 


More details and tests are in [32]. 


B Other numerical tests 


Here we detail some tests which have little bearing on the extrapolation to a final result, 
and which we have therefore not put in the main development. 

In the main text we spend some effort considering when to stop the growth of x(T). 
But we do not discuss when to measure the axion number, arguing only that it is sufficient 
to measure after the string network is gone and only small fluctuations remain. Here we 
justify this claim. Figure 7 shows what happens when we measure the axion number before 
the string network has finished collapsing. The figure shows the density of strings in blue, 
and the density of axions, as measured at the indicated time, in black. This measurement 
is somewhat ambiguous because it involves identifying the axion angle 04 € [—7,7] which 
is discontinuous across domain walls. Such a discontinuity leads to “ringing” in the Fourier 
spectrum and formally gives a logarithmically UV divergent particle number (cut off by 
lattice effects). We “fix” this problem by truncating the largest ĝa values, reflecting 6, € 
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Early-measured axion number 
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Figure 7. Blue: scaled string density £ as a function of time. Black: instantaneously measured 
axion number at the same time. Dashed lines are for an evolution with an upper cutoff on x(T) at 
x(T) < m?/5. 


[1/2,7] to 04 — T — 04, and similarly for 6, € [—7, —7/2]. Despite this “cap” on the 
maximum size of |04| < 7/2, we nevertheless find a very large axion density if we measure 
axions before the network has decayed. However, we see that after the strings are gone, 
the axion number becomes completely independent of further time evolution. Our “cap” 
on large 04 values has no effect on this final plateau, because |04| > 7c/2 virtually never 
happens and represents a tiny fraction of the axion number. In light of this result, we 
generally measure Nax as soon as no strings remain, but when we evolve for longer and 
remeasure later, we get an answer which agrees at the 196 level. If we repeat this analysis 
for the misalignment scenario, we find that instead of becoming virtually t-independent 
at t = 4t,, the axion number becomes virtually t-independent already by t = 2t,. This 
difference reflects the absence of topological structures in the misalignment scenario. 

The other test which proves to play almost no role in the final axion density is the box 
volume. To test out to very large and quite small volumes, we used the rather small value 
of mt, = 200. Keeping everything else besides the volume unchanged, we find in Figure 8 
that the volume has less than a < 2% effect on the axion abundance down to a box length 
of Lt, = 2. Note that any box larger than Lt, = 8 should have essentially zero volume 
dependence, since the box periodicity is invisible for Lt > 2 and the axion number becomes 
an adiabatic invariant by t = 4t,. 

The smallest volume shown, L = 1.28t,, shows a larger generated axion number, with 
larger statistical fluctuations (we used more simulations for smaller volumes so the product 
of simulations and volumes is about the same for each data point). The reason is that, in 
a small fraction of small-L simulations, after all strings annihilate, there remains a domain 
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Volume dependence of axion production 
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Figure 8. Study of box volume dependence of the axion production rate. Except for the smallest 
volume, it appears the result shows extremely weak volume dependence. 


wall stretching across the whole box. This domain wall is metastable and lasts indefinitely, 
until it dominates the axion number. This is purely a small volume artifact; nothing of the 
sort ever occurs for the larger volumes. 

Because the volume dependence is so mild, it should be possible to study the axion 
production in boxes down to Lt, = 2 or 3. However, to be conservative, we have generally 
tried to keep Lt, > 5. Recall that there cannot be lattice volume dependence for L > 2t, 


and we need t = 4t,, so the box size dependence should be exactly zero for Lt, > 8. The 
extremely weak box size sensitivity allows us to relax this value somewhat. 
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ABSTRACT: We revisit the U(1)4 anomaly in the holographic model of low-energy QCD 
by Witten, Sakai, and Sugimoto, presenting a new and direct derivation of the Witten- 
Veneziano mechanism for generating the mass of the 7’ through an anomalous mixing of 
the Ramond-Ramond C4 field with the singlet component of the pseudoscalar mesons. 
The latter turns out to have a kinetic mixing with the normalizable modes of the Cj 
field representing pseudoscalar glueballs, yielding additional vertices for their production 
and their decay that dominate over those of the unmixed case considered previously in 
the Witten-Sakai-Sugimoto model. The leading channel is predicted to be decay into two 
vector mesons, followed in importance by decay into three pseudoscalar mesons. The issue 
of production of pseudoscalar glueballs in radiative J/y decays and in double diffractive 
processes is also discussed briefly. 
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1 Introduction 


In QCD, the U(1),4 part of the tree-level flavor symmetry U(Ny;)r x U(Ny)n is broken by 
the axial anomaly, leading to only N2— 1 pseudoscalar Goldstone bosons in the spontaneous 
breaking of U(1)y x SU(N); x SU(Ny)n — U(Ny)v with masses determined by the Gell- 
Mann-Oakes-Renner relation, while one isoscalar pseudoscalar boson, the r/', is found to be 
too heavy to be a Goldstone boson [1]. Its mass is determined by nonperturbative effects 
involving the axial anomaly, and it was suggested by ’t Hooft [2, 3] that these are due to 
instantons. However, in the limit of large number of colors Ne the effects of a dilute gas of 
instantons is exponentially suppressed. A different mechanism was proposed by Veneziano 
[4] and Witten [5], who showed that the nontrivial 0-dependence of large-N. pure Yang- 
Mills theory implies a mass term for the singlet 79 according to me = 2NFXq/ f2, where Xg 
is the topological susceptibility of pure Yang-Mills theory. 

In [6, 7| it was shown that this mechanism is indeed realized in the AdS/CFT frame- 
work. In the Witten model [8] of low-energy QCD, which is based on a supersymmetry- 
breaking circle compactification of Ne D4 branes in type-IIA supergravity, this was initially 
discussed in a model with flavor D6 branes |7] and subsequently also in the chiral Witten- 
Sakai-Sugimoto (WSS) model [9, 10], which is based on D8 branes and which realizes a 
fully nonabelian chiral symmetry breaking. 

A more detailed discussion of the Witten-Veneziano mechanism in the WSS model was 
recently given by Bartolini et al. in [11]. In the present paper we present an alternative, 
more direct derivation, which moreover allows us to reanalyse the possible mixing of the 
singlet no with the pseudoscalar glueball G. In the bottom-up V-QCD model of ref. [12], 


where the Veneziano limit of Ne — oo with Ny/N¢ fixed is employed, such mixing in the 
context of the Witten-Veneziano relation has been discussed in [13]. In the WSS model, one 
of us with F. Brünner has found [14] a vanishing mass mixing of no and G which led to the 
conclusion of a very narrow pseudoscalar state, because to leading order in the WSS model 
only vertices involving jointly the scalar glueball and ro appeared to be present. Our new 
derivation reveals the presence of a kinetic (derivative) mixing which leads to additional 
decay modes of pseudoscalar glueballs that are dominating those considered in Ref. [14] or 
in the phenomenological model of Ref. [15]. 

The importance of gluonic contributions for the physics of 7’ mesons is already evi- 
denced by the Witten-Veneziano mechanism. It has also been emphasized by the analysis 
of ref. [16], who have however left open the question whether this involves a coupling to 
physical pseudoscalar glueball excitations (see also |17-19]). Possible pseudoscalar glueball- 
meson mixing scenarios have been discussed widely in the literature [20-25], but exclusively 
in the form of mass mixings. The mixing scenario that we obtain in the WSS model leads 
to a dominance of the couplings that are induced by the Chern-Simons term, in particular 
to two vector mesons. Inclusion of quark masses leads to additional couplings such as to 
three pseudoscalar mesons. 

Assuming that the couplings obtained in the WSS model are more reliable than the 
results for the glueball masses, which for tensor and pseudoscalar glueballs are much lower 
than those predicted by lattice QCD, we consider a range of pseudoscalar glueball masses, 
with the result that a pseudoscalar glueball of around 2.6 GeV as indicated by (quenched) 
lattice QCD is found to be a rather broad resonance instead of a narrow state, which may 
hinder its experimental identification. On the other hand, its relatively strong coupling to 
vector mesons should enhance its production cross section in central exclusive production 
as well as in radiative J/w decays. 

In sec. 2 we review the WSS model, in particular the role of the Ramond-Ramond Cj 
which determines the 0 parameter of the dual theory, and how the U(1)4 anomaly and 
the Witten-Veneziano mechanism is realized. In sec. 3 we derive the mixing of ng and the 
pseudoscalar glueball modes contained in the normalizable C1 modes, and we work out its 
consequences for glueball-meson vertices and decay patterns of the pseudoscalar glueball in 
sec. 4, followed by a discussion and an outlook in sec. 5. 


2 Witten-Sakai-Sugimoto model and Witten-Veneziano mechanism 


2.1 The Witten model of low-energy QCD 


The WSS model is based on the Witten model of low-energy QCD [8] provided by the 
near-horizon geometry of a large number (Ne) of coincident D4 branes in type-IIA su- 
perstring theory wrapped on a circle of circumference Ry = Qn Mice with anti-periodic 
boundary conditions for fermions. Since gauginos are massive at tree-level and adjoint 
scalars acquire masses through loops, the dual theory at energies much smaller than Mxx 
is pure non-supersymmetric Yang-Mills theory at large 't Hooft coupling A = Nga. In 
the supergravity approximation, where the Kaluza-Klein mass scale My is kept finite, the 


background can be obtained from the dimensional reduction of an 11-dimensional doubly 
Wick-rotated black-hole geometry in AdS7 x S^, 


r? y L? dr? L? r$ 
I2 [f(r)dxy RE mde’ da jr dri] r2 f(r) 4 dO, f(r) Sh a (2.1) 


where u,v = 0,...,3, n, = diag(—1,1,1,1), together with a nonzero 4-form field strength 


2. ines 
dst, = 


F4 = 3(L/2)3w4, where w4 is the volume form of a unit 4-sphere with volume V4 = 8717/3. 
A regular Euclidean black hole is produced by identifying 7 = x4 c x4 + 27Ry, with 
R4-— Mig = L?/(3rxx). 

Dimensional reduction from 11-dimensional supergravity [26] with «7, = (27lp)?/(47) 
and lp = ge through 711, œ z41 + 27 R11, with R41 = gsls, and 


és, = G 4 gd dz” = e’? guyd dz + et? (del! + A rd My? (2.2) 


where e? = (r/L)?/? and hatted (unhatted) indices refer to 11 (10) dimensions, leads to 
type-ILA supergravity with string-frame action 


S —Sws + SR + Scs, (2.3) 


where 


1 1 
Sns ==> | dz/—ge ?? (n -40y 90M — 5 asa) , 


267o 
1 1 
SR=-— he. d'°x/—g (ior F Fal”) ; |j = ue pe 
Kio p- 
1 
Scs = — ad. Bə ^ F4 A F4. (2.4) 
Kio 


In order to have a standard form of D-brane actions with prefactors up = (22)-91; ? = 


for both the DBI and CS parts, we absorb the factor g? originally contained in Hi = 
2&7, / (21 R11) by rescaling F24 = gs 39 and e~® = g,e-?194, upon which we drop the 10d 
labels on the fields and redefine 2425 = (27/15. 

In the coordinates used in [9, 10], 


r? rê U3 
U=—, K(Z)=14+277=— =— 2.5 
T KT gp (25) 
the 10-dimensional metric reads 
p x Rp4 M? [ qu? 
ds, = (=) ,dz" da" + f(U)da2] + (==) | + uža] 2.6 
10 Rpa [n f ( ) 1 U KU) 4 ( ) 


with f(U) = 1— (Ukk/U)? and Rp4 = L/2; the dilaton and 4-form field strength are given 
by 

e? = g,(U/ R4) 4 (2.7) 
and is 
Di y, 


s 


Pres (2.8) 


(Here and in the following we stick to the usual normalization of the Ramond-Ramond 
fields in the string-theory literature [26]. In [9, 10] these fields are rescaled according to 
Cp = (21)? !1$C55.) The 4-form field strength is related to the number Ne of D4 branes 
by 


(27)713 f Fy = 27 N,, (2.9) 
S4 


which implies R3, = (L/2)? = ng, Nel3. 
The parameters! of the dual boundary theory which upon dimensional reduction through 
the circle S. with radius My becomes pure 3+1-dimensional Yang-Mills theory 


1 0 
L = ———Tr Fy? + — Tr Fym A Fy, 2.10 
29] |Fvw| t gà yM ^ FYM (2.10) 


can be identified from the UV limit U — oo of the D4 brane action 
1 
got = —p4Tr [ 47. /—9(5)(1 + 5 Qro IF? E J 
+u (2ra)? / Ci ^ Fym ^ Fym + ..., (2.11) 
with u4 = (27)~41>° and a! = l2. This gives 


Jm = 21g.l,Mkk, 0-4 2nk = > Cı (2.12) 
s JS, 
with k an integer. 

The Witten background reviewed above has a vanishing one-form field C- and thus 
corresponds to the theory with vanishing 0 parameter, or sufficiently small 0/N, such that 
the backreaction on the background can be neglected, in the k = 0 branch. (The fully 
backreacted case has been worked out in Ref. [28].) 

The holographic dictionary thus relates non-normalizable fluctuations of C to a local, 
possibly z-dependent 0 parameter. Normalizable modes are instead interpreted as pseu- 
doscalar (JPC = 07+) glueball excitations [29]. 

The relevant quadratic action for these fields is 


1 
SRD- zx | 47s |F|? 
Ai ( ) 


2r RAVa J ji F rL r7 2 
mE d dr | —— 1" "0,C,9,C; + — (0,C; 2.13 
641 i TKK : f(r)" ? i L’ l ) l ) 
The resulting field equations have the non-normalizable solution 
l 
(uu, 09 = 
Cy On Ra f(r)0(a) for O V(x) = 0. (2.14) 


In the Witten model of pure Yang-Mills theory, 0 = 0, but since this connection will be 
modified when quarks are included, we have providently introduced a different symbol. 


‘Following the notation of [9, 10, 27], gem differs by a factor two from the usual particle physics con- 
vention so that a, = g%y,/(27). 


Inserting (2.14) in the action gives 


3,74 
Xg 4,, 02 A Mk 
Sp=—-= [| d'x0 ———M 2.15 
R 2 y X , Xg 4(37)6 ( ) 
with Xg being the topological susceptibility. 
The normalizable solutions will be expanded in mass eigenfunctions, with radial eigen- 
value equations 


al aon mon (2.16) 
"Lp T flr) GT , ] 

subject to boundary conditions CO (r = oo) = CO = rkx) = 0, ðP (r = rkx) Æ 0. 

This determines the mass of the lightest pseudoscalar glueball as Mg = 1.885... x Mpk. 


2.2 Inclusion of quarks and U(1)4 anomaly 


Sakai and Sugimoto have extended the Witten model by introducing left and right handed 
chiral quarks through Ny pairs of D8 and D8 probe branes localized at separate points on 
the circle S, at the holographic boundary. Chiral symmetry breaking U(Ny)r x U(Ny) rR > 
U(Ny)r.-R emerges from the fact that the D8 and D8 branes have to join in the cigar-shaped 
background geometry. Choosing antipodal points on S$, leads to embedding functions with 
constant z4 = T and a joining of the branes at the tip of the cigar at rgg (or Ugg). In this 
case one can extend the coordinate Z introduced in (2.5) to the range —oo...oo in order 
to cover the radial extent of joined D8 and D8 branes. 


The action for these flavor branes is given by SP5 = EDS. + SP3 with 


D 
S hi = — Hg I. 


e-t T- det (g9) + 2ra F + B2), (2.17) 


where Tr denotes symmetrized trace, ug = (27-81; 9). and F =dA+ .A ^ .A is the field 
strength tensor for the nonabelian flavor gauge fields living on the D8-branes. (Bə is the 
bulk Kalb-Ramond 2-form field; its normalizable modes contain the pseudovector (1*7) 
glueballs of the dual gauge theory |29, 30].) 

The even and odd radial mode functions of .A,, are associated with the towers of vector 
and axial-vector meson fields, with the lowest mass eigenvalue (m2, ~ 0.660 Mg.) being 
identified as the p meson mass such that Mpg = 949 MeV. 

'The massless pseudoscalar Goldstone bosons are described by 


oo 
U(x)=P expi (/ dZAz(Z,2)) = exp (II^ A^ / fr), (2.18) 
—Ooo0 

where f2 = six AN Mx and A^ are Gell-Mann matrices supplemented by A = (N5/2)-1/21. 
Setting f, = 92.4MeV fixes A ~ 16.63 for Ne = 3. A smaller value of about 12.55 would be 
found by matching instead the large-N lattice result [31] for the string tension. As in [32] 
we shall consider the range A = 16.63...12.55 in order to obtain a theoretical error band 
for our quantitative predictions. 


The 9-dimensional Chern-Simons term of the flavor brane is given by 
SDE —us x v A(R) Tr exp (219! .F + B) ^ Cs, (2.19) 
q D8 


where the so-called A-roof genus factor involves A(R) = 1+ iu ER AR-+... with 
RMN = ilk LMNdzK* dx’. The term involving C3 contains the Wess-Zumino-Witten term 
of the dual gauge theory because F4 = dC3 is nonzero in the background [9, 33]. 

The Chern-Simons term is also involved in the U(1)4 anomaly of the dual gauge theory, 
because the C7 term modifies the equations of motion of the C4 field which is responsible 
for the 0 parameter. Using Hodge duality, dC7 = Fs = «F5, and integrating by parts, one 


finds that 


Scs D sino! | Tr (F2 ^ C7) 
D8 


= sso! | Tr (A1) ^ P5 A wr 
= ug2Ta" jJ da? 4/|gio| [5 (7) + 6 (r — «)] 
x (Tr (A;) 9" 9 0-C, + g"" g^" Tr (Au) 0C) , (2.20) 


where wr = |ô (T) + ô (T — 1)| dr has been introduced to extend the integration to the entire 
bulk spacetime. Thus the flavor probe branes induce a mixing term of the Abelian part of 
the flavor gauge field A:= N; `Tr A living on the D8 brane with the field Cz. The linear 
equations of motion for C; get an additional term localized on the D8 brane, 


1 r7 
323, 0 (8c) = = 2no' ugð, (A/—99"" g^ Tr (Ar)) (8 (7) +8 (T — v) 


= 2mo/ ugÓ,. (zm vy Ar] (6 (7) -ó(r — 7)), (2.21) 


(8) 


which we solve by introducing the localized fluctuation C7 ^ according to 


3 CÈ) = Ana! kous Ne À, (8 (T) + 6(7 — m)), (2.22) 


where we have assumed co — 0 with respect to Minkowski space coordinates and ð À" = 
0. Switching to Z? = (r/rxx)® — 1 and with 


/2 
V Ng f. inte 


Ani ay = (a), (2.23) 


we thus obtain 


Z 
co = f dZ ðzC® 
0 


=4ra dota yN = arctan( (Z)mo(x) [ó (T) +6 (7 —7)]. (2.24) 


The “anomalous” part co contributes to the 0 parameter, 


o=] Q=}! Fy = iz? [ draco (cio sp Cs 


Sr cigar 
2mo/ Ak? ug / Ng 2N; 
=v | =v EEE : 2.25 
(x) afl, mo (x) = v(x) f. mo (2) (2.25) 


In the presence of flavor branes, the 0 parameter is therefore no longer given by V alone, 
but also involves no. 

In the remainder of this work we will set the 0 parameter to 0, which corresponds to 
non-vanishing (x) according to 


2N; 
fr 


v(x) = — no (a). (2.26) 


The meson field no therefore also appears in the non-normalizable mode c which is a 
fundamental ingredient in the realization of the Witten-Veneziano mechanism in the WSS 
model. 

Note that, in the chiral case, a constant 0 can be absorbed simply in a field redefinition 
No — not f40/ y2N; , since only derivatives of 79 appear in the effective action produced by 
Sp However, introducing mass terms for quarks either through world-sheet instantons 
or nonnormalizable modes of bifundamental fields corresponding to open-string tachyons 
[34-39] produces the additional term 


LM x f dtz Tr (M U (x) 4- h.c.) , (2.27) 


where such a redefinition changes the phase of the quark mass matrix M = diag(m,, mq, Ms) 
according to M — Me?®/Nr, 

Both, in the chiral limit and in the case with nonzero quark masses, the singlet no 
receives an extra mass term that is determined by the topological susceptibility of pure 
Yang-Mills theory obtained in (2.15), 

2N; 


ma = F Xo (2.28) 


in accordance with the Witten-Veneziano formula [4, 5]. 


3  Pseudoscalar glueball-meson mixing 


With the additional term c x no we have solved the anomalous equations of motion.? 
Now we will determine the field redefinitions that are necessary to obtain a diagonal action 
for the Minkowski space fields yo and G. In doing this we encounter divergent terms 


? As stated after (2.22), we had to assume CÓ) = 0 with respect to Minkowski coordinates. With ro 
picking up the mass m$, this assumption is violated, but only at higher order in Ny: CO ~ N P : No x 


Ny = (With nonzero quark masses, this seems safe as long as their contributions to the masses of the 
pseudoscalar mesons are much smaller than mo.) 


proportional to 6(0) in analogy to the Hofava-Witten calculation [40]. By adding to the 
Lagrangian terms beyond the probe approximation one would presumably be able to cancel 
these divergences. In the following we will however just drop terms proportional to 6(0), 
i.e., the (c2 term in Sp and the co term in Scg. 

Let us start with the effective kinetic terms coming from the first part of (2.13), 


st "Em ul. di poor d'a 91€, 8,6. (3.1) 
With C, = C9 + c9 +0 and cO (r, x) x G(x) we write them as 
oe f dx Ca + Cdumd"G — 50,0") l (3.2) 


(2) 


where we have fixed the normalization of the radial mode functions in C7 


TÌ vc r3 ~\2 1 
a d CO =e: 3.3 
12K?) Mxx i Fr) ( v ) 2 (9) 


For the constant $1 which corresponds to a wave function renormalization of ro we obtain 


by requiring 


T? % HOU Nj- 
ee a, (0 | 909 CO p2 — NF a 
a — 12€) Mkk [ul or fr) (ct Ter a [o veoh (3.4) 


where C, is divergent, since it is obtained by integrating non-normalizable modes, but (; is 
suppressed by a factor N./N,y compared to the kinetic term for no contained in SP8 The 
constant Çə which is associated with a kinetic mixing of 79 and pseudoscalar glueball modes 


is finite and given by 
3 


T Be Loa " 
Be n O d (0) ()) cO) 
Q2 uoa. | "Fir (cs Gr jos / (mã) 


= 0.011180 AES (3.5) 
sü eee 


for the lowest pseudoscalar glueball. (The next-to-lightest, excited pseudoscalar glueball 


has (9 = —0.014314. .. / Nz/N-A.) 


The remaining terms from the background action (2.13) and the CS action (2.20) 


1 lr 
poem s [eo ga (00 
" Ak? 24 [3 
+ pg2ra! f d?r (8 (7) +8 (T —7))- V |gio| Tr (Ar) g^ g^ Or Cr (3.6) 


give the effective mass terms 


mass 1 1 X X 
Se =- 3mo = gee + G37oG, (3.7) 


with the values already determined above, 


mê = i. rd “(a CO In l = AN (3.8) 
a Pim DIAA en) eN, BE 
and (for the lightest pseudoscalar glueball mode) 
TVA a lr’ ~\2 
Me = 4 | dra (00M /G) = 3.5539Mg. 3.9 
b= ae]. tris (600 ix (3.9) 


The terms involving 6,09, [c + c cancel by the equation of motion (2.22) with the 
CS term. The mass mixing term 9G vanishes because 


C3 ex m dr 77,0 A,02 LIN dr ðC? (3.10) 


as already pointed out in [14]. 
(50) 


However, the term involving C; in (3.5) has produced a nontrivial kinetic mixing 


term of order (Ny; IN. To get rid of such mixing terms one has to perform a non-unitary 
field redefinition. In the present case, one has to make the substitutions 


no > (1 + Gi) rio + GG, 
Go (1 + 56) G, (3.11) 


which yield 


ilin. 1 1 2 ^y 1 
Lr ES -3 Qno)? = 5 (9.6) + 0,190" G (201% ar 56) 
1 1 z 
-3 Mél + G)G* — smong — momo + O(NT/N?) (3.12) 
1 1 1 A 1 
—5 (uno)? - 2 (0,8 — 2 M&( + G)G? — zmn + O(NS/N2P)), 


where we have taken into account that C9 x 4/Nf/N. while Cierny, x N;/N.. The field 
redefinitions (3.11) thus diagonalize the bilinear terms up to and including order N;/N., 
yielding also a small positive contribution to the pseudoscalar glueball mass term, 


Mà, = (1789.0 MeV)? — (1 + (0.011182)? N5/N.) MẸ = (1819.7... 1806.5 MeV)? (3.13) 


for A = 16.63...12.55. 


Dropping all terms of order (Ny/N-)3/? 


and higher, we see that the divergent coefficient 
Çı can be ignored and that (3.11) reduces to 


~ N u 
no — lo + C2 G = No + sors Ata e 
Cc 


G 5G. (3.14) 


Note that this is very different from the mixing scenario proposed by Rosenzweig et 
al. in [20, 21] and also considered in [23]. In that scenario it is assumed that the chiral 
anomaly is not saturated by so alone, but only together with the pseudoscalar glueball 
field?. This leads to a nondiagonal mass matrix for nọ and the pseudoscalar glueball which 
can be diagonalized by an orthogonal matrix. The field definition (3.11), on the other hand, 
is nonunitary. It is needed to remove the kinetic mixing term (3.2), but originally there is no 
mass mixing term because of (3.10); a mass mixing term appears after the field redefinition 
in (3.12), but only at the order NY : INÈ /2 which is beyond the probe approximation.4 

In Ref. [14], where (in view of the vanishing mass mixing term) the interactions of the 
unmixed pseudoscalar glueball were considered, it was found that those are given either 
by pairs of pseudoscalar glueballs interacting with scalar or tensor glueballs or by a vertex 
connecting a pseudoscalar glueball with ro and a scalar glueball. The latter, which is 
relevant for the decay of a pseudoscalar glueball, is due to the fact that the integral (3.10) 
for the noG mass mixing term no longer vanishes when metric fluctuations dual to a scalar 
glueball are inserted. Assuming that this is the dominant vertex, a very narrow pseudoscalar 
glueball was predicted whose decays had to involve 7 or 7 together with the decay products 
of a scalar glueball. 

Through the shift (3.14) a pseudoscalar glueball acquires also all types of vertices 
that the singlet 79 possesses. In the DBI part of the D8 brane action, such vertices are 
contained only in terms involving F to fourth and higher power, which are suppressed by 
higher powers of o/. In the chiral limit, the dominant interactions come from the CS part 
of the D8 brane. With quark masses introduced according to (2.27), further interactions 
arise from (assuming M = MÌ) 


Tr (M(U + U1)) > Tr (M(U +U1)) + if. A/2/Ng G Te (M(U — U1)). (3.15) 
The additional term in (3.15) also contains a bilinear term involving G and 10,8; 


Xx | 1 1 2 
(m+ 18) m — af 20 — m) 


un 


It implies a further mixing of G with jos Which turns out to be negligible as concerns the 


AL?) = (3.16) 


masses of 7, n’, and G when the mass matrix of the G-no-ng sector is diagonalized. Due 
to the large mass of the pseudoscalar glueball, the corrections to the masses of rj, r/', and 
G are only about —0.015%, —0.00896, and +0.09%, respectively, and thus can be safely 
neglected. Also the vertices induced by the kinetic mixing of G and no receive only small 
corrections of the order Com? / m2, « (5. However, the small mixing of G with Ng gives rise 
to new types of vertices from the DBI action. 

In the next section we shall consider the consequences of all these additional interactions 


in turn. 


3 According to [20], this additional field could however also be interpreted as a radial excitation of 7’. 

^In [22-24] a unitary mixing of 7’ and pseudoscalar glueballs has been considered from a phenomenologi- 
cal point of view, which has an important impact on the determination of the pseudoscalar mixing angle 0p 
when there is significant 7/-G mixing. By contrast, the nonunitary transformation (3.14) that we obtained 
does not have this effect (to leading order). 
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4 Pseudoscalar glueball decay modes 


4.1 Decay into two vector mesons 


The dominant decay channel of the pseudoscalar glueball mode turns out to be a decay into 

two vector mesons. This decay arises from the Chern-Simons term and is mediated by the 

mixing of yo with the glueball G according to (3.14). Before the field redefinition we have 

the interaction term 

(2o/? 
3! 

(27ra") 


3 
5 n SOT sprawe |a (Az AL, As) , (4.1) 


She 2 us [REF AFAC) 


with Az oc no, which after the field redefinition yields the term 
Bs 2 ki Ge"? TrO, v, Opve, (4.2) 
with the coupling constant 


1 2o)? 
heu in a O / AZ bowt 
V Nr; 29s 
= & x 817.39 92 NL NP et 


= 9.8092 MENZ !A^?, (4.3) 


for the lowest vector meson mode y1. For the lowest axial vector mesons with radial mode 
function w2 with mass square eigenvalue m2, N 1.57MŻk z (1190MeV)?, which is quite 
close to the experimental value of 1230 MeV of the lightest a; axial vector meson,? we 
obtain instead kg = 4.8888 MghNy 3X72. 


Performing the polarization sums we obtain the amplitude squared 


»3 IM(G > 27 = 8k? M? (+ = m?) l (4.4) 


€1,€2 


where M and m are the masses of G and v (and similarly for the heavier vector and axial 
vector modes when M is large enough). 

'The partial widths for decays into the various vector mesons are listed in table 1 for 
A = 16.63...12.55 and glueball mass Mg given alternatively by the WSS result (3.13) and 
by an (admittedly speculative) extrapolation to the lattice prediction of 2600 MeV, where we 
have assumed that the mixing parameter C? and thus kı (which has inverse mass dimension) 
scales like Mc ! when the glueball mass is raised. (Keeping the mixing parameter as is, the 
widths for a 2600 MeV glueball would all be a factor of about 2 larger.) Also with the 
assumed reduction of kı, a 2600 MeV pseudoscalar glueball is thus projected to be a rather 
broad resonance. 


°In table 1 we ignore the slightly heavier fı and Kı axial vector mesons, which would start contributing 
significantly only for Mg well above 2600 MeV. 
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Tp(Ma=1813+7MeV) Tp(Ma=2600MeV) 
pp 36.8... 45.0 190. . . 248 
ww 11.4... 13.8 62...81 
K*K* 2.7...1.8 189... 246 
od - 29...38 
ajay - 3.1...4.0 
|» 51...61 473...618 


Table 1. Partial decay widths in MeV for the decay of a pseudoscalar glueball in two vector mesons 
with à = 16.63... 12.55 and Mg given by the WSS result (3.13) and also when extrapolated to the 
lattice prediction of 2600 MeV with reduced mixing (see text). 


4.2 Decay into three pseudoscalar mesons 


In Ref. [41], Gounaris and Neufeld have proposed that a pure pseudoscalar glueball decays 
predominantly through an interaction Lagrangian involving iG Tr (M(U — U1)) in order to 
explain that the pseudoscalar glueball candidate at the time, 1(1460),° seemed to decay 
mainly into K Kx. Decays into nm would thus be suppressed by a factor (mz/mx)*. 

In sect. 3 we have found that such an interaction Lagrangian is in fact generated in 
the WSS model with nonzero quark masses by the kinetic mixing of 79 and pseudoscalar 
glueball modes. Explicitly, it reads 


ALm = ito — 2 G' Tr. M(U — Ut), (4.5) 
2./2N; 
with 


M = diag (m2, m2, 2m — m2) (4.6) 


in the isospin symmetric case Mu = mq. This contains vertices of the pseudoscalar glueball 
with K Ks, K Kn(), and n(")? of the order of (2m2./ f2, and vertices with (')mm propor- 
tional to (9m2/ f2. 

In table 2 we list the resulting partial decay widths for a pseudoscalar glueball with 
mass M = 1813 7 MeV according to (3.13) and, as above, with mass extrapolated to 2600 
MeV as predicted by quenched lattice QCD. 


In chiral perturbation theory, a well-known problem is that the leading-order La- 


grangian severely underestimates the decay 7! — nam whose leading-order amplitude is 
proportional to m2, to wit, |M(n! — nrr)| = |22cos(20p) — sin(20p)|(m;/ f.)?/6. This 
accounts for only about 3% of the experimental result. Higher-order terms in chiral pertur- 
bation theory which are not present in the WSS model have been shown to give contributions 
which are much larger |45, 46]. By the same token, the partial decay rates of a pseudoscalar 
glueball into three pseudoscalar mesons that result from its mixing with no can be expected 
to be strongly underestimated by (4.5), in particular the amplitudes for decay into nar, 


°This is now listed as two states (1405) and 9(1475) by the Data Particle Group [42], of which the 
lighter one is still considered occasionally as a glueball candidate [43] in view of its supernumerary nature 
with respect to the quark model, while others question the existence of two separate states and argue for a 
single 7(1440) [44]. 
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D;(Mga—1813x7MeV) D; (Mga-—2600MeV) 

K Kn (w/o K K*) 0.398. . . 0.387 0.558. . . 0.550 
K K (w/o fo(1710) etc.) 0.0263. . . 0.0064 0.1092. . . 0.0285 
KKmw - 0.3303. . . 0.3570 
TTN 0.0048. . . 0.0061 0.0049. . . 0.0061 
nry! 0.0011. . . 0.0007 0.0021. . . 0.0013 
nnn 0.0039. . . 0.0007 0.0337. . . 0.0064 
quy - 0.0564. . . 0.0277 
may - 0.0047. . . 0.0048 
PP (w/o fo(1710)n) 0.036. . . 0.014 0.540. . . 0.431 
fo(1710)n — PPRC” [14] 0.0068 . . . 0.015 2.5...4.3 


Table 2. Partial decay widths in MeV for the decay of a pseudoscalar glueball in three pseudoscalar 
mesons resulting from (4.5) with \ = 16.63... 12.55. The results for Č — K K do not yet include 
the decays G —^ KK* — KKr) discussed in sec. 4.3, and those for Č — PPn(') with P a 
pseudoscalar meson neglect G — fo(1710)n — PP). For comparison, the decay modes for an 
unmixed pseudoscalar glueball via G — fo(1710)n(’) as obtained in Ref. [14] are evaluated in the 
last entry of the table for the same set of parameters. These latter results are however incomplete 
in the mixed case (see text). 


which are proportional to m2. The other decay amplitudes are of the order of m3, but 


since mg is small compared to mg, formally higher-order contributions could again be 
much more important. However, even if we assume that these partial decay widths are 
underestimated by an order of magnitude, they are still much smaller than the decays into 
two vector mesons. 

In Ref. [14], one of us with F. Brünner has calculated the decays of an unmixed pseu- 
doscalar glueball into three pseudoscalar mesons in the WSS model, which necessarily in- 
volves ro and a scalar glueball. If the latter is identified with fo(1710), which decays pre- 
dominantly into kaons (as is predicted [47, 48] by the WSS model when scalar glueballs have 
no or negligible decay width into nn’), the dominant channel is Č — fo(1710)n > K Kr, 
whereas KKr decays are excluded. With mixing, additional amplitudes for G > PP7(') 
arise from Cait and G acalssstensor( Quin)" terms in the DBI action and also from the 
additional terms involving co in Sg. They would have to be included in a complete cal- 
culation of PPr() decays. To give an idea of the magnitude of the contributions from 
G — fo(1710)n(’) + PPn('), in table 2 we have included the partial widths of these decay 
modes as obtained in the unmixed case in [14]. While such contributions are moderate for 
the pseudoscalar glueball mass (3.13), they become important for Mg extrapolated to the 
mass predicted by lattice which is well above the threshold for f9(1710) 4-7 decays. However, 
also these contributions are always much smaller than those for the decays G — vv. 


4.3 Decay into one pseudoscalar meson together with one or two vector mesons 


The very small mixing of a pseudoscalar glueball with ng that is induced by the quark 
mass term (3.16) also gives rise to vertices with one pseudoscalar meson and one or two 
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I, (Mg=1813+7MeV) T,(Mg=2600MeV) 
KK* 0.381. . . 0.288 0.302. . . 0.225 
"K'K* - 0.0113... 0.0112 
KK*p - (2.50.,,2 47) 1073 
KK*w - (0.799. . . 0.787) x107% 
KK*ó - (0.253,..0.249) 10^? 
nK*K* - (0.097. ..0.026) x 107? 
Table 3. Partial decay widths in MeV for the decay of a pseudoscalar glueball into one pseudoscalar 


meson and one or two vector mesons. 


vector mesons due to the terms involving Tr (OIL, [V^, II]) and Tr ([V,,, II]|V", IT]) in the 
Yang-Mills part of the effective action. When the glueball mass is above the respective 


threshold, this gives rise to decay modes 
G— KK*,cK* K',KK*o,K K'w,KK*ó,n)K* K*, (4.7) 


where K K* is short for K K*, K K* etc. The corresponding amplitudes are proportional to 
(m? — m2)/ m2, ~ 107? times a factor gos, X AMANDI? or gporr X ATING S 

In table 3 the resulting partial decay widths are listed. While the partial width for a 
decay of the pseudoscalar glueball into K*, one further vector meson and one pseudoscalar 
meson are rather small, the partial width for G > K K* (and subsequently + K K) turns 
out to be comparable to that of the direct decay Č — K Kr evaluated above. Since we 
assume that the latter may be underestimated significantly, we have not carried out a full 
calculation of G > K K combining coherently both amplitudes. 

In summary, we are finding a rich pattern of decays of the pseudoscalar glueball in 
three or more mesons, which involve vertices which are proportional to pseudoscalar meson 
masses. The by far dominant decay modes are however given by two vector mesons. 


5 Discussion and outlook 


If the mass of the pseudoscalar glueball is as given by the WSS model, (3.13), the pseu- 
doscalar glueball is predicted to be a rather narrow resonance with relative width T/M ~ 
0.03...0.04. However, the WSS model appears to underestimate significantly the mass of 
heavier glueballs. The tensor glueball comes out as 1487 MeV, while it is predicated as 
around 2400 MeV by both quenched and unquenched lattice QCD [49-53]. Extrapolating 
our results to a pseudoscalar glueball mass as indicated by (quenched) lattice QCD, 2.6 
GeV, the prediction is instead that of a very broad resonance. In fig. 1 we display two pos- 
sible extrapolations, one with unchanged vertices and mixing, which leads to an extremely 
large total width, and the somewhat more moderate scenario (underlying the above tables) 
where the mixing parameter Cə is assumed to be decreasing like Mwss/M. 

The rather large decay width of a pseudoscalar glueball to two vector mesons that we 
obtained from the WSS model also implies a significant coupling to photons because of the 


vector meson dominance that is inherent in the WSS model [10]. This also applies to the 


— 14- 


F/M 


Figure 1. Relative pseudoscalar glueball decay width T/M for A = 16.63 (blue) and 12.55 (orange) 
with two different extrapolations to larger masses: same vertices and mixing (full lines) or when 
the mixing parameter Çə is assumed to be decreasing like Mwss/M (dashed lines). 


scalar and tensor glueballs studied before in [32]. Radiative glueball decays will be studied 
in detail in forthcoming work, as well as the contribution of glueballs to hadronic light- 
by-light scattering, which is of relevance to the theoretical prediction for the muon g — 2, 
where holographic QCD has been argued to provide a promising framework for quantitative 
predictions [54]. 

An important implication of the mixing-induced vertices determined in this work is 
that contrary to the unmixed case [32] the WSS model predicts sizeable vertices for the 
production of single pseudoscalar glueballs in double diffractive scattering. While Regge 
regime hadronic scattering requires extensions and extrapolations of the WSS model [55- 
57], the structure of the vertices with Reggeons and Pomerons are completely determined 
by the Chern-Simons action and the mixing of the pseudoscalar glueball with rg. For 
Reggeons, the vertices have the form (4.2), while Pomeron vertices can be derived from the 
A-roof genus factor in (2.19) as in [55]. This leads to' 


Los 2 Ge?" | ra Pads P^ + ye OL Pug (823 Pap — IoDaP%,)| 6-1) 


with C2ka Mk, Cony Mg x ACUTN Oe The results for the unmixed case of ref. [32] instead 
implied that central exclusive production of pseudoscalar glueballs required the forma- 
tion of pairs GG, n(’)G, or Ü wagen with vertices of order A! N77, NV Ne, Or 
ATIN tNe 3, respectively. The possibility of production of single pseudoscalar glueballs of 
course lowers the threshold significantly. 


"Up to partial integrations, the tensorial structure of the two terms in (5.1) is in fact in one-to-one 
correspondence to the two couplings Jop yy and Sep x, for two tensor pomerons with one pseudoscalar meson 
in [58]. 
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Similarly, the kinetic mixing of the pseudoscalar glueball with jg allows the production 
of single pseudoscalar glueballs in radiative J/1» decays with rates of the parametric order 
Nr! times that of the decay rate for J/V — Yne- 

However, the (semi-)quantitative results that we obtained within the WSS model sug- 
gest that the experimental identification of the pseudoscalar glueball may be made difficult 
by a very large decay width if its mass is around 2.6 GeV as indicated by (quenched) lattice 
results. In future work we intend to further explore these predictions in the context of 
radiative J/y~ decays and central exclusive production in the Regge regime. 
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Abstract 


We calculate a new contribution to the axion mass that arises from gluons prop- 
agating in a 5th dimension at high energies. By uplifting the 4D instanton solution 
to five dimensions, the positive frequency modes of the Kaluza-Klein states generate 
a power-law term in the effective action that inversely grows with the instanton size. 
'This causes 5D small instantons to enhance the axion mass in a way that does not 
spoil the axion solution to the strong CP problem. Moreover this enhancement can 
be much larger than the usual QCD contribution from large instantons, although it 
requires the 5D gauge theory to be near the non-perturbative limit. Thus our result 
suggests that the mass range of axions (or axion-like particles), which is important 
for ongoing experimental searches, can depend sensitively on the UV modification 
of QCD. 


1 Introduction 


The axion is arguably the best motivated new particle beyond the Standard Model (SM). 
Its existence is required by the Peccei-Quinn mechanism [1], which is a popular solution to 
the strong CP problem. In particular, the axion is identified with the Nambu-Goldstone 
boson [2, 3] that arises from a spontaneously broken U(1) symmetry. This symmetry is 
explicitly broken by QCD instantons, which generate a nonzero axion mass and makes 
the axion a viable dark matter candidate [4-6]. A large experimental effort is devoted to 
searching for the axion, and therefore knowing the mass range of the axion is important. 

The origin of the axion mass can be traced to the so-called large instanton contribu- 
tions! in QCD. These are contributions to the path integral that arise from instantons of 
size p ~ 1/Aacp, where Agcp is the QCD strong coupling scale. This IR contribution 
dominates the integration measure over the instanton collective coordinates because the 
theory is asymptotically free and therefore instantons of much smaller (UV) size give negli- 
gible contributions to the non-perturbatively generated axion potential in QCD. However 
this implicitly assumes that the QCD coupling remains asymptotically free in the UV and 
QCD dynamics is not modified below the Planck scale. 

Thus in attempts to enhance the axion mass it is natural to speculate on possible UV 
modifications of QCD dynamics, such as those considered in Refs. [8-16]. Of particular 
interest to us will be the possibility proposed in [17-19], where QCD is strongly coupled 
in the UV and thus enhances the small instanton contributions to the axion potential. 
In this paper we explicitly construct such a UV modification by embedding QCD in a 
five-dimensional (5D) theory. In addition to the bulk QCD gluons, we identify the axion 
with the 5th component of a U(1) gauge field, while the axion-gluon coupling arises from 
a 5D Chern-Simons term. 

This UV modification of QCD implies that the axion mass can now receive contribu- 
tions from 5D small instantons. The 5D instanton solution is obtained by simply uplifting 
the usual 4D instanton [20] to five dimensions. This gives a finite 5D action provided the 
extra dimension is compact (of size 7R), and leads to a well-defined semiclassical expan- 
sion of the path integral around this solution [21]. The axion mass contributions can then 
be calculated in the perturbative limit by restricting the number of Kaluza-Klein modes. 
As was shown in [22] using deconstruction, besides the usual logarithmic terms present 
in the effective action in the instanton density, there is a power-law term R/p that arises 
from the positive frequency modes of the Kaluza-Klein gluon states. We show that this 
result can also be obtained by performing a fully 5D calculation of the Kaluza-Klein con- 
tributions to the effective action which is just a 5D version of 't Hooft’s computation [23]. 
The power-law term in the effective action can be sizeable for small instantons (p < R), 
leading to a possible enhancement of the axion mass. 

This new contribution to the axion mass can be compared with the usual low-energy 
QCD contribution from large instantons. Interestingly, we will see that the 5D small in- 


‘Large instanton contributions correspond to the strong coupling regime of QCD and thus are not 
calculable. However, using chiral symmetry one can relate the axion mass to the equally incalculable but 
experimentally known pion mass, see for example [7]. 


stantons can provide the dominant contribution, but at the expense of the 5D theory being 
near the non-perturbative limit. In addition the enhancement is maximized only when the 
SM fermions are confined to the boundary of the extra dimension. Under these conditions 
we find that the axion mass can be enhanced by many orders of magnitude, depending 
on the size of the extra dimension. Since the 5D theory is near the non-perturbative limit 
we also consider the impact of higher dimension terms in the 5D Lagrangian, and show 
that they lead to noticeable but controllable effects provided the scale suppressing the 
higher dimension terms is smaller than the scale at which the 5D theory becomes strongly 
coupled. Our results for the axion mass have consequences for the experimental efforts 
searching for the axion (or axion-like particles, in general) with the conclusion that large 
regions of parameter space could remain viable. 

The outline of this paper is as follows. In section 2 we present our 5D model for a 
pure Yang-Mills theory. The 4D instanton solution is then uplifted to five dimensions 
and shown to give a power-law term in the effective action in section 2.1. The effect 
of including higher dimension terms is discussed in section 2.2, and the contributions 
to the axion mass are calculated in section 2.3. In section 3 we consider the fermion 
contributions to the axion mass for the case of boundary fermions (section 3.1) and bulk 
fermions (section 3.2). In section 3.3 we compare the dynamics of 5D small instantons 
with the small instantons of 4D moose models introduced in [24, 25]. Our concluding 
remarks are given in section 4. Appendix A contains the details of the 5D calculation 
of the Kaluza-Klein contributions to the effective action, while the calculation performed 
using the 4D deconstruction is summarized in Appendix B. 


2 5D Instantons and the Axion Mass in a pure Yang- 
Mills theory 


We will consider a 5D spacetime (x",y) where the 5th dimension, y is compactified on 
an orbifold of size L = mR with the QCD gauge group SU(3). in the bulk. The bulk 
QCD gauge boson Ay (M = p,5) will have (+,+) boundary conditions for the A, 
components, while the A; components will have (—, —) boundary conditions.” The QCD 
gluon is thus identified with the zero mode AQ. In addition, the bulk contains a U(1) 
gauge group where the U(1) gauge boson By has (—, —) boundary conditions for the 
B, components, and (+,+) boundary conditions for the B5 component. This ensures 
that there is a massless pseudoscalar zero mode B® (to be identified with the axion), 
whereas the Kaluza-Klein (KK) scalar modes (B5) are eaten by the KK U(1) gauge 
bosons to become massive. We will start by first considering the pure YM case without 
any fermions. 

In order to generate an anomalous axion coupling to gluons below the compactification 
scale 1/R, a bulk Chern-Simons term must also be added. The 5D Lagrangian of the 


?The notation (-,-) refers to either Neumann (+) or Dirichlet (—) boundary conditions at y = 0 (first 
entry) and y — L (second entry). 


SU (3). x U(1) theory with a Chern-Simons term is given by 


1 
— Fin e ’ 


Ag Z 

(1) 
where Gyn(Fyy) is the gluon (U(1)) field-strength tensor, bog is a dimensionless con- 
stant and we have equally normalized the non-Abelian and Abelian gauge fields with g; 
the (dimensionful) gauge coupling. The 5D gauge theory has a UV cutoff A; whose max- 
imum value occurs where the theory becomes strongly coupled, g2^5/(247?) ~ 1. Higher 
dimension terms in the Lagrangian are expected to be suppressed by A5, and for now they 
have been neglected in (1). Later we will see that they can have an important effect on 
the 5D instanton. Note that a 0 term is not allowed in the 5D action (1) due to Lorentz 
invariance, but can be present on the 4D boundaries. However the U(1) symmetry in 
(1) can be used to eliminate these boundary 0 terms. This is one of the distinguishing 
features of our 5D model compared to that of the 4D moose models [24, 25] where there 
is a theta angle for each SU(3) gauge group, and therefore one has to also introduce an 
axion at each site. 

Upon compactification we obtain the effective 4D Lagrangian 


E 
--f d [ dy (zz z Tr[GA; s] + o oN Bw'TrIGwnGsr] + 


EIE (Eme, | zs ; 5 Ira, C] + jay +...) (2) 


where g, is the 4D QCD gauge coupling, Gy, = GQ is the QCD gluon field strength 
tensor, a = B® /gs is the axion®, and the couplings are identified as 
1 X 1 
9 99 f 


= besgsL . (3) 


We will assume that the 5D cutoff of the model, A; lies at or below the strong coupling 
scale ~ 247°/g?. Using (3) this translates into the limit 


ree. (4) 


Os 


where a, = g2/(47) and for an orbifold, L = m R. Thus for a, ~ 0.1 we obtain A5 R < 200. 


2.1 5D Small Instantons 


The extra dimension provides a UV modification of QCD at the scale 1/R > Aocp, 
where Agcp œ 300 MeV is the QCD strong scale. It is thus possible that instantons of 
size SS R can give large contributions to the axion mass. Let us consider first how the 


3 Alternatively the axion could be a localized boundary field that couples to Tr[G Nei Our analysis 
also applies in this case. 
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Figure 1: € as a function of R/p. 


instanton calculus is modified in the 5D pure YM SU(3) theory. The compactified 5D 
theory admits the following instanton solution in Euclidean space: 


A (z,y) = AU" (a), As(z, y) = 0, (5) 
where jen 
ue oce (6 


=m +p? * 
is the 4D instanton configuration [20] in the regular gauge with center x and size p. The 
tensors 5, are the group-theoretic 't Hooft eta-symbols [23] and a denotes the gauge 
isospin index. The 5D instanton solution (5) can be simply thought of as wrapping the 
AD solution (6) around the compact dimension. 

Note that in the deconstructed version of [22] the above 5D solution corresponds to a 
multi-instanton configuration with winding numbers (1, 1,..., 1). Importantly, it does not 
appear to be the continuum limit of the 4D instantons used in the moose model of [24] that 
corresponds to the combination of (1,0,0...,0)+(0,1,0...,0)+(0,0,1...,0)+.... This 
latter configuration would correspond to instantons localized in the bulk i.e. A (x,y) = 
A‘ (x)6(y). However this is not a solution of the 5D equations of motion, and therefore 
the 4D moose model of [24] does not reconstruct to a 5D theory. 

The 5D instanton solution (5) minimizes the action (1) to give (ignoring for now the 
axion terms) : 

_ E _ 2n (T) 
95 Os 


where we have used the relation (3) with L = mR. To obtain the contribution of the 
instanton to the partition function we must also consider the fluctuations of the 5D 
gauge fields about the instanton solution (5). This means not only including the gluon 


e 


fluctuations but also the KK contributions. The details of this calculation are given in 
Appendix A, and the final result for a pure Yang-Mills SU(N) theory is presented in (41). 
For SU(3) the result is 


R dp ( 2T i K 
— C[3] | —— ) est = —, 8 
Ju. C (gm Ri H 
where C3] ~ 1.5 x 107%, and the effective action is given by 
2m R R 
Se = Se — 3E(R/p)— «-byln— , 


where a,(1/R) is the YM coupling evaluated at 1/R (see (40) for the exact definition) and 
bo = 11 (the pure QCD £ function coefficient) is the contribution from the gauge boson 
zero modes. The function €(R/p) is plotted in Figure 1. The quantity K that appears 
on the r.h.s. of (8) is a dimensionless factor resulting from evaluating the integral in (8). 
Note that since we are only considering the effect of 5D small instantons, the integration 
region in (8) is limited to 1/A5 < p € R. The dependence of the lower limit of integration 
on 1/As can make the contribution very sensitive to the UV completion details. This will 
be further discussed in Section 2.2. 

The result (8) reveals a new, interesting feature. There is a power-law term (R/p) in 
the exponent arising from the KK modes with a positive coefficient, £(R/p) > 0, which 
now causes the integral over the instanton size p to receive a large contribution from the 
small instantons of size, p ~ 1/As. As we will show, in some parameter regions this 
contribution can overcome the IR contribution dominated by large instantons of order 
p^ 1 / Agcp. 

An approximate expression for the dimensionless factor K on the r.h.s. of (8) can 
be obtained by evaluating the integral in (8) and using the fact that €(R/p) ~ 1/3 for 
R/p Z 20. This gives 

ER 27 8 8—bo ,— — rg tAsR 27 i e aam tek 
Ke C{3] (<a) (A; R) e oesü/R) = c[3] Gant (MR)? 
Thus we see that for sufficiently large A5 R, the power-law term in the effective action (9) 
leads to an exponential enhancement that can overcome the suppression from e-?7/es(1/R) 
to give a UV-dominated contribution to the integral in (8). Note that for the calculation 
to be reliable, A;R cannot saturate the bound (4), otherwise higher-loop corrections 
in the instanton background will be equally important. Furthermore the fact that the 
contribution (10) is cutoff dependent suggests that higher dimension terms in the 5D 

Lagrangian are also important and should be considered. 


(10) 


2.2 Higher dimension terms 


To study the impact of higher dimension terms, we consider the addition of the following 
dimension six term to the 5D action: 


1 ss C6 / 
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where cg is a dimensionless constant. We will assume that cg > 0 so that it stabilizes the 
instanton action. Substituting (5) into (11) and performing the 5D integration leads to 


2x ƏT C6 
-+ 
Qs Qs (Aso 


Sen = ; - 3E(R/p)4. +... , (12) 
) p 

where the logarithmic term in (9) has been neglected. Note that the instanton solution 
(5) is itself modified by the order 1/A2 terms in (11). However these corrections lead 
to subleading terms of order 1/(Asp)* in S,g, and can be neglected. Whereas Sag is 
extremized near the UV size 1/A5 when cg = 0, the inclusion of the higher dimension 
term in (12) instead leads to an extremum 


- ~ S e(njp) (35) s, us 


where the p dependence in £(R/p) has been neglected since it is approximately constant 
for p « R. As long as the theory is perturbative at the cutoff, g2A5/(245?) < 1, 
the extremum condition (13) implies p, >> 1/As, and therefore the contribution (8) is 
dominated by instantons of size p,. As alluded to earlier, this means that the instanton 
size is effectively cutoff at p., and the factor K is approximately given by the expression 
(10) with A; replaced by 1/p,. Of course there is no need to rely on the approximate 
expression, and one can simply perform the numerical integration in (8) to obtain the 
exact factor K. To reiterate the salient point, the integral in (8) with the higher dimension 
term (11) included, is dominated by instantons of size p, where the 5D theory remains 
perturbative, and therefore contributions from instantons of size 1/A; are suppressed. 
Furthermore, higher dimension terms (beyond those of (11)) can be neglected, as they are 
suppressed by higher powers of dem « 1, and the calculation remains under theoretical 
control. 


2.3 5D Small Instanton Corrections to the Axion Mass 


To calculate the contribution to the axion mass from the enhanced instanton density, 
we next add the axion field. For a constant background axion field, a, in the instanton 
background (5) we obtain 


a 1 
$— 
f 321? 


nz TriG OG Ow = z (14) 


where the winding number is one. The effective action (9) is then modified by replacing 
Set — Seg — ia/ f. Summing over both instanton and anti-instanton contributions in the 
dilute instanton gas approximation [26, 27], leads to 


eGCl1r 4 ia\ T a Kas 
n,n-0 k=1 k=1 
K a 
= exp D d*x cos (5) (15) 


We conclude that the contribution of the 5D small instantons to the axion mass is 
2_ 2K 
i f? RA^ 
where K is defined in (8) and f is given in (3). 
To understand the importance of the 5D small instanton contributions to the axion 


mass, it is instructive to compare these UV contributions with the IR (large) instanton 
contributions, which for the pure YM case are estimated to be m? js ~ A75/ f^, and 


(16) 
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is defined as the IR scale at which the gauge coupling becomes strong. The axion mass 
ratio is then 


Ma 27 : 1 =3 Ta TR 7B 

EOS ~ 2C'[3] Gan (A Ri (iai^ R) (18) 
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where we have used the approximation (10). It is clear that ma 2 


= Marr when the 
exponent in (18) is = 0. By writing 


67€ 

A;R = a (1/85 ' (20) 
where e S; 1 using the perturbativity condition (4), we obtain a positive exponent in (18) 
for e = 7/33 ~ 0.21. This shows that a large contribution to the axion mass from 5D 
small instantons requires values of A;R near the non-perturbative limit. In Figure 2 we 
plot the mass ratio Ma/Ma rr using the exact result for K, and taking different values 
of the perturbativity parameter e defined in (20). We see that in order to have a large 
contribution to the axion mass we must be quite close to the non-perturbative limit e ~ 1. 
In the limit in which the theory is strongly coupled at As (ce ~ 1) and the instanton 
contributions are dominated by instantons of size 1/A;, we cannot reliably calculate the 5D 
instanton contribution to the axion mass. However a naive dimensional analysis estimate 
gives i 
iit, di * ’ 
up to an order-one constant. This estimate corresponds to the maximum value of the 

axion mass from 5D instantons, and is shown as a red line in Figure 2. 


(21) 


3 Fermion contributions and QCD axion mass 


So far we have considered a pure YM theory. We next introduce fermions in order to 
discuss QCD in the SM. In five dimensions the fermions are not chiral, and thus to 


Figure 2: The axion mass ratio for the pure YM case as a function of ArrR for various 
contours of e = (0.3, 0.25, 0.2) (top to bottom). The solid blue lines are the exact numerical 
integration results using (8) with no higher dimension terms (cg = 0). The green dashed 
line represents the addition of the higher dimension term (11) with cg = 0.5 and e = 
0.52(0.47) for the upper (lower) line. The red line represents the maximum enhancement 
using Ma s that would occur at the 5D strong-coupling limit. 


reproduce the SM matter content one must impose boundary conditions that project out 
the unwanted Weyl components of the Dirac fermions. Alternatively, one can assume that 
all SM matter fields are confined to the boundary of the extra dimension. Indeed, the 
assumption that all SM fermions are on the boundary is essential for our purposes: as 
explained in [22] the sign of the R/p term responsible for the enhancement of the small 
instanton contribution flips if the number of bulk fermions is sufficiently large. As shown 
in Appendix A, in the 5D orbifolded theory this happens when Ny > 9N/4. Thus in 
section 3.1 we will first consider boundary fermions (identified with SM matter fields), 
and calculate the small instanton contributions to the axion mass assuming the 5D theory 
is in the perturbative regime. We will also estimate the axion mass when the UV cutoff 
is at the strong-coupling scale ~ 247°/g?. Bulk fermions will then be discussed in section 
d. 


3.1 Boundary fermions 


If the SM fermions (as well as the Higgs field) are on the boundary and SU(3) is in the 
bulk, the instanton integral remains the same as (8) except that bọ = 7 (to account for the 
fermion zero modes). Furthermore, fermion zero modes would naively lead to an extra 
suppression factor in the instanton density of the form (pm)! where my are the fermion 


m Ys 


Figure 3: Instanton vacuum diagram arising from closing the SM fermion legs with Higgs 
loops. 


masses and Ny the number of flavors. However since the fermion masses in the SM arise 
from a Higgs mechanism, the fermion legs in an instanton vacuum diagram can be closed 
with a Higgs loop (see Figure 3). Thus the suppression is only proportional to the Yukawa 
couplings and loop factors, namely: 


_ Yu Ud Uc Ys Ut Yo 


— x 107% 22 
Ff An 4m 4r Ar 4r 4n , (22) 


where Yu,dc,s,t,b are the SM Yukawa couplings. This is one of the ingredients leading to 
the enhancement of the axion mass in the 4D moose models of [24, 25], as well as in our 
5D model. 

With the introduction of fermions, the axion mass low-energy contribution can be 
unambiguously determined from QCD chiral perturbation theory to be [2, 28], 


2 £2 
MuMaq mif 


Myu+tma)? jf? ' 


T QCD = ( (23) 
where m, c 135 MeV, fr ~ 92 MeV, and m,/mq œ 0.46. Using the result (10) with 
bo = 7 and including the factor (22), the axion mass ratio becomes: 


Ma or M (mitm) 1 e fx 8) 
c 4/26 5013 (24 
Ma,QCD vac E73 vm,ma MrfrR? (A5 R)2®-3) (24) 


Here we have not considered higher dimension terms, so the path integral is dominated 
by instantons of size p. ~ 1/As. As discussed in sec. 2.2, the presence of higher dimension 
terms can increase p,, making the result less dependent on the cutoff. Note that in (24) 
the chiral suppression factor Kf is mitigated by the fact that a, runs slower towards the 
UV due to the SM fermions, and therefore as(1/R) is larger, implying that the exponential 
suppression is smaller. Using (20) and approximating the pion and quark masses with 
the QCD scale, we obtain a positive exponent in (24) for e = 0.14. The exact numerical 
result for m, /ma gcp is plotted in Figure 4, where a sizeable enhancement can be seen 


Ma 


Ma,QCD 


1000 108 109 1012 1015 1018 
1/R (GeV) 


Figure 4: The arion mass ratio for the boundary fermions case (assuming as(mz) = 
0.118), as a function of 1/R for various contours of e = (0.3,0.25,0.2) (top to bottom). 
The solid lines are the exact results obtained from a numerical integration of (8) and no 
higher dimension terms (cg — 0). The green dashed line represents the addition of the 
higher dimension term (11) with cg = 0.5 and e = 0.52 (0.47) for the upper (lower) line. 
The red line depicts the maximum enhancement in the strong coupling limit using mj. 


that depends sensitively on e. An enhancement at low compactification scales requires 
larger values of e. 

The maximum possible enhancement occurs when the 5D theory is strongly coupled at 
As. In this case the naive dimensional analysis estimate for the axion mass with fermion 


contributions then becomes R 
m, s, id "rp , (25) 


where the suppression factor «Ky defined in (22) has been included, since we assume that 
there are no other sources of chiral breaking in the 5D model beyond the SM Higgs 
Yukawas. 

The fermion suppression Ky in (25) could actually be removed if we relax this as- 
sumption and consider extra heavy Higgs fields coupled to the SM quarks with order one 
Yukawa couplings [8]. However this comes at the expense of possibly introducing new 
CP phases in the heavy Higgs couplings that could spoil the axion solution to the strong 
CP problem. Even if these heavy Higgs fields are introduced (and for some reason do 
not introduce new phases) the suppression cannot be entirely removed because there is 
a maximum value for the 5D instanton contribution to the axion mass. This arises from 
the fact that 5D small instantons can also contribute to the up-quark Yukawa coupling 
(for instance, yP) that cannot be larger than the experimental value y, ~ Mu/v (unless 
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we tune yD) with the SM Yukawa coupling). By closing the up-quark and Higgs legs of 
this instanton e we can then obtain a two-loop contribution to the axion mass 

I 4 2 
(isn)? a S Won) 
and the instanton (yy?) contributions this would then represent the maximum possible 
enhancement allowed for alleviating the « suppression in (25). 

Finally one may ask why the result of our 5D instanton calculation is interpreted as 
an additional contribution relative to the low-energy expression (23), rather than being 
merged into it. The point is that there are different symmetry breaking parameters that 
control (22) and (23). The chiral perturbation theory result (23) vanishes in the limit 
of vanishing quark masses, Mua — 0, while our instanton contribution to the axion 
mass squared is proportional to the Yukawa couplings yu a (as seen in (22)). In fact the 
instanton result does not vanish in the limit of zero Higgs vacuum expectation value as 
can be inferred from the diagram in Figure 3. The Yukawa couplings explicitly break the 
axial U(1)4 symmetry, and this effect is combined with the anomalous U(1)4 breaking by 
the instanton to give the total instanton contribution depicted in Figure 3. On the other 
hand the chiral perturbation theory contribution to the axion mass results from current 
quark masses which formally is a different source of explicit U(1)4 and the SU(2) chiral 
symmetry breaking. In summary there are two independent contributions to the axion 
mass: one proportional to m,mgq and the other proportional to y,yg, and they should 
both be taken into account. 


of order m? ~ 3 5. Absent a fine tuning between the tree-level (yu) 


3.2 Bulk fermions 


Next we consider the fermions propagating in the 5th dimension. For each chiral SM field 
we need to introduce a Dirac fermion in the 5D bulk. Thus assuming V; quark flavors 
there are 2N; Dirac fermions. The QCD contribution from the gauge boson and fermion 
zero modes still gives bọ = 7, but now there is also a contribution from the massive Dirac 
KK fermions. Using the results in Appendix A we see that for the SU(3) case, we have 
(9) with the replacement 


3€(R/p) > (3 — 4N;/3)&(R/p) . (26) 


Compared to the pure YM case given in (8), the enhancement in the exponential factor 
e ^*t from the power-law R/p term is now reduced as the number of flavors in the bulk 
increases. In fact for Ny = 3 the sign of the power-law R/p term flips, which now 
suppresses the 5D instanton contribution. Furthermore, the fermion zero modes again 
lead to à suppression in the instanton vacuum diagrams due to Yukawa couplings and 
Higgs loops (assuming the Higgs is confined to the boundary). Therefore generically the 
best possible case for an axion mass enhancement occurs when there are no bulk fermions. 

However we would like to point out that the introduction of bulk fermions can increase 
the size of the dominant instantons, p,, and therefore make the 5D calculation less depen- 
dent on the cutoff. Indeed, if bulk fermions have large boundary localized mass terms, 
mp > 1/R, then their KK modes will not contribute to the effective instanton action 
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at scales between 1/R and mg. On the other hand, these heavy fermion KK modes will 
contribute to the effective instanton action at scales above mg. As a result the effective 
action will have an extremum at 1/p, ~ mg that can be chosen to be smaller than the 
strong scale 247? /g?. 


3.3 Relation to 4D moose models 


The contribution of small instantons from a compactified 5th dimension shares some 
features with the 4D moose model [24, 25). In moose models the enhanced contributions 
of small instantons arise due to a high index of embedding of a QCD instanton into 
the gauge group of the microscopic theory [29]. Indeed, in the simplest moose model 
the QCD gauge group, SU(3)., arises as a diagonal subgroup of a larger product group, 
SU (3) x SU(3)2 at some UV scale, with a theta angle and axion at each site. The QCD 
instanton of the two-site moose model corresponds to a multi-instanton configuration 
of the microscopic theory (specifically, the (1,1) configuration). In other words, the 
small instantons in the broken gauge group factors correspond to "fractional" instantons 
of QCD, and their weights, AD and Ae. are large compared to the weight of the QCD 
instanton contribution, Abcr [29] (where b1, b2, and b are the £ function coefficients of the 
corresponding gauge groups). As a result, the small instanton contributions to the axion 
masses in the broken factors are enhanced. Since the lightest mass eigenstate of these 
two axions plays the role of the QCD axion, this may be heavier than predicted by QCD 
alone. The detailed calculation in Ref. [29] shows that the small instanton enhancement 
is not sufficiently large in a two-site model* but could be significant in k-site models with 
k > 3. Increasing the number of sites invokes the analogy between moose models and the 
deconstructed description of 5D theories. 

It is important to note that there is a significant qualitative difference between multi- 
axion moose models and truly 5D theories. In fact, the moose models of [24, 25] do 
not have a 5D continuum limit because they have more than one axion and theta angle. 
Furthermore in truly 5D theories, the enhancement of small instanton contributions is 
not a consequence of a non-trivial index of embedding. Indeed, in the fully 5D theory, the 
gauge group is SU(3) both in the UV and IR, and therefore it is obvious that the index 
of embedding for both small and large instantons is the same. Instead, the instanton 
action has two minima — one in the IR where QCD becomes strong and another in the 
UV where the 5D theory becomes strong. It is the existence of this second minimum in 
the instanton action that is responsible for the axion mass enhancement. 


4 Conclusion 


We have shown that if QCD gluons propagate in a 5th dimension at high energies, then 
the effective action receives a power-law term R/p due to the positive frequency modes 


“This is a consequence of interactions between the Higgs field responsible for breaking the product 
gauge group, and the small instantons in the broken gauge group factors [29, 30]. 
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of the Kaluza-Klein states arising from the uplifted 4D instanton solution. This power- 
law term can cause the 5D small instanton contributions to the axion mass to dominate 
over the large instanton contribution in the IR, and therefore enhance the axion mass. 
However as shown in Fig. 2, the effects are sizeable only near the non-perturbative limit 
of the 5D theory. Therefore higher loop contributions could change our conclusions. We 
have also considered the inclusion of higher dimension terms and shown that they lead 
to an extremum in the effective action at instanton sizes p, >> 1/As5. In this case the 
dominant contribution to the axion mass arises from small instantons of sizes larger than 
the cutoff. 

When the Standard Model fermions are included there is a suppression in the axion 
mass proportional to the product of Yukawa couplings. The axion mass can still receive a 
sizeable enhancement from the 5D small instantons provided the fermions are confined to 
the boundary. Otherwise if fermions propagate in the bulk, the KK fermion contribution 
reduces the coefficient of the power-law term (or can even flip its sign), therefore making 
the enhancement much smaller. 

Importantly the 5D small instanton contribution to the axion mass calculated in this 
paper can preserve the Peccei-Quinn solution to the strong CP problem. This is evident 
because the axion potential arises only from the anomalous coupling a Tr[G,,,G""], and 
not from other sources of Peccei-Quinn breaking. In other words, there is no misalignment 
and the 5D small instanton contributions only scale up the axion potential. Therefore 
the fact that instantons depend on physics at high energy scales suggests that the axion 
mass can also be a sensitive probe of UV physics. 
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A Five-dimensional instanton 


Let us consider an SU(N) gauge theory and study the 5D fluctuations 0A, around the 
instanton solution (5). We choose a 5D generalization of the ’t Hooft gauge [23]: 


1 2 
Lor m Eu [DM A, + 05 As] ; (27) 
5 


where DPA, = 0,Ay — i[AM, A,], is the gauge covariant derivative evaluated on the 
instanton background. This requires a 5D Faddeev-Popov ghost term: 


1 
Lgh = zc [e(—D(»? + 2)c] ; (28) 
5 


where c is the ghost field. Performing a KK decomposition for the fluctuations of the 
gauge field 6A, ghost dc, and 5th component 6 As: 


óÁ,(,y) = 5AM) (a) -+ 73 SAM (a) cos(ny/ R) , (29) 
óc(r,y) = bce (x) + 2» 6c" (x) cos(ny/ R) , (30) 
óAs(x,y) = 2s 8 AC? (x) sin(ny/ R) , (31) 


and replacing A, — AP +6A,, A5 — ôA; and c — óc in the action (1), we obtain at 
quadratic order, and after integrating over the extra dimension: 
I 1 v = 
Sg = $; )- 2g dz Tr [6A M 6AM + 5c My be (32) 
€ Y (GAM MY + miu) + 8E (Men + m2) 5c) + SAP (Ms + m2) ], 


n=1 


where gs is defined in (3), m, = n/R, and the expressions for the gauge boson (gluon) 
operator M?" and ghost operator Mg, can be found in [23]. The &AC? fluctuations 
behave as 4D scalars of mass m,, in the adjoint representation of the SU(N) group, and 
therefore Ms is the same as the operator expression for a 4D massless scalar Mo given 
in [23]. 

The existence of 4N zero-frequency modes for 6AQ. corresponding to those of a 4D 
instanton (four associated with the instanton location (rg), one for its size (p) and the 
rest for the orientation in group space), tells us that there are 4N eigenstates satisfying 
Mi 5A = 0. Therefore, since M^" is the same for the zero mode as well as for the 
n-th KK mode, we have that for each KK mode there are 4N eigenstates satisfying 
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MW SAL” + m25A = m25AM, ie, they have eigenvalues m2. Using a 5D Pauli- 


Villars field of mass M > 1/R to regularize the theory (whose KK squared masses are 
M?--m2), we can integrate the path integral over these 4N modes and obtain the following 
contribution to the partition function: 


oo M? 2\ 2N 
(==) . (33) 
n=1 


The 4N zero-frequency modes of 5A” must be treated as collective coordinates, which 
means including a pre-factor in the integration of the form 


f te f eI aa (Mp)^* , (34) 


where the coefficient C[N] is given by 


Ci e O2N 
(N-1KN-3' 


CIN] = (35) 


and Ci, Cz are order one constants (Ci = 0.466, C5 = 1.679 using Pauli-Villars regular- 
ization [27]). 

In addition to the zero-frequency modes, there are also positive frequency modes for 
óAQ and 5A” and their corresponding ghosts. In 4D this was calculated in [23], where 
it was shown that the massless gauge bosons and ghosts combine to give a contribution 
equivalent to two real scalars in the adjoint of SU(N). The contribution is approximately 


e73 (Mp) | (36) 


For the nth KK mode we expect a similar contribution where both the transverse part of 
the gauge bosons and ghosts combine to give the contribution of two real scalars of mass 
m2. In addition this must also be combined with the contribution of the longitudinal 
component, 5A” to provide three real scalars of mass m2. Due to their masses, the 
contribution deviates from the massless result (36) and the massive contribution was nu- 
merically calculated in Ref. [31]. We have used this latter result to obtain the contribution 


of positive frequency modes of all nth-KK modes: 

PES oo N 
-3N X4 ( ds In ( He ) + nima) Ts (mao) m2 T NE(R/p)R 
€ ( ( n ) ) = I] M24 m2 € P (37) 


n=1 


where ÎS is defined in [31] and we have used the quite accurate interpolating function 
given in Eq. (6.2) of [31]. The function €(R/p) defined from (37) is shown in Fig 1. Notice 
that for R >> p the function tends to a constant value €(R/p) ~ 0.35, and therefore the 


exponent in (37) has a power-law enhancement « R/p. 
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Finally, combining the contributions (33), (34), (36) and (37) gives the SU(N) result: 


2N 
Ju (oUm +m, 3i NE d CN] nj (= J G as EN(S ln(Mp) HU) . (38) 


n=1 


To understand the physical implication of this contribution, it is convenient to write the 
bare coupling a, appearing in the exponent of (38) as a function of a more physical gauge 
coupling. To do so, we first calculate the one-loop self-energy of the massless mode AQ 
regularized by the 5D Pauli-Villars field. This is given by 


1 1 M? Z M?4+m? 
2X v. n 2 


where bọ = 11N/3, bgg = 7N/2. The value of A(q?) is independent of M and m,n, and 
for q? < 1/ R? is very small, A(q?) < 0.01, and can be neglected. We can then define the 
renormalized one-loop gauge coupling of the massless mode AQ at the compactification 
scale as 

1 240m? 


1 LM 
= 4nll(g?— R?) ~ — — — ac In(MR)? + bc V n — 


a(1/R) DHT ni (40) 


n=1 


Substituting (40) into (38), we obtain the SU(N) result 


2N 
[an [ Sow (=) e es aam NEGRO) 5 —boln 5 (41) 
p Qs 


where we see that the Pauli-Villars mass regulator M has disappeared from the exponent. 
The result for N = 3 is given in (8). There still remains the bare coupling a, in the pre- 
factor that when written in terms of a,(1/R) will depend on the regulator mass M (as 
can be seen from (40)). To eliminate this regulator dependence one should go beyond the 
one-loop level calculation. As a good estimate for p S R, one can replace a, by a,(1/R). 


A.1 Fermion contributions 


It is straightforward to incorporate the effect of fermions. For the instanton to contribute 
to the vacuum energy and the axion potential, fermion zero modes need to be soaked up. 
Usually this is achieved by lifting zero eigenvalues with insertions of the Higgs vacuum 
expectation value. In the single Higgs doublet models considered here the instanton 
contribution can be obtained by soaking up sets of four fermion zero modes with the 
Higgs propagators (see Figure 3). As discussed in the main text, this leads to a pre-factor 
yr/ (41) for each fermion, and (41) is now modified by having by = 11N/3 — 2N,/3 and 
NE(R/p) > (N — 4N;7/3)E(R/p), where Ny is the number of 5D bulk fermions in the 
fundamental representation. In the case of just boundary fermions, bo still changes but 
NE(R/p) remains unchanged. 


16 


B 4D Deconstruction 


In this Appendix we present the calculation of the small instanton contributions using the 
AD deconstruction method in [22]. In the notation of [22] the 5D instanton solution (5) 
corresponds to a multi-instanton configuration with winding numbers (1, 1,..., 1). Using 
this solution, the effective action Ser is given by the SU(N) generalization of equation 
(27) in [22], namely? 
27 R R 

Seg (FC > p > 1/45) = @.(1/R) Se + &4N In "i (42) 
where As is the 5D cutoff. In the step approximation, the numerical factor £y = é& = 1 
for a pure 5D gauge theory compactified on a circle of radius R. The coefficient of the In 
term, 4N is the number of bosonic instanton zero modes. 

In orbifold compactifications AQ is projected out and there are half as many KK 
states, therefore the expressions of [22] must be modified. We find £y = 3 and £j = £2. The 
step approximation can be improved by including threshold corrections which can further 
modify £o; (an example with a bulk scalar in the fundamental representation shown in 
Section 5 of [22] reveals an approximately 3096 change). The expression (42) assumes that 
1/p is sufficiently below the top of the KK tower, and also that R/p > O(10) so there is 
a large number of KK modes lighter than 1/p that feel the instanton. Substituting (42) 
into the partition function gives the leading behaviour 


2m | R R 
c7 — er HON EGAN E (a3) 


This expression reveals that there is an exponential enhancement of the instanton density 
for small instantons of size p « R (assuming £o > 0) due to the power-law term in (42). 

It is instructive to compare this effect with the 4D moose models of [24, 25]. In that 
model the instanton density at each site is suppressed by a factor e-?7/e: = e727/(Nas) 
since 1/o, = 1/o4 + 1/ag + --- ~ N/a, assuming approximately equal site gauge cou- 
plings and N sites. Clearly as N — oo the large exponential suppression is mitigated. 
Alternatively this is equivalent to the freedom in choosing a large site gauge coupling 
Qi. Instead in the extra dimension a single axion receives contributions from the N sites 
which sum to give the factor e-?7/^s, There is no factor of N in the denominator of the 
exponent as found in the 4D moose model, and instead a sum over KK modes gives the 
power-law enhancement in (42). 

To obtain the full instanton contribution to the partition function, the instanton den- 
sity must be integrated over the instanton size p. The integration over the instanton 
size is divergent but is regulated by the finite size of the extra dimension. Assuming an 


?Note the coefficient of the log term in Eq.(27) of [22] should be 8 corresponding to the number of 
bosonic zero modes for SU (2). 
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orbifold compactification (€ = L6 = ax) we obtain 


R 6 
(p C3 On amie (bo4 inf 
s CB] e 
1/As P as (1/p) 


2 6 2 -zm tR 1 
e C[3] | —— : (44) 
oa,(1/R)) 3 (AsR)%-9/4 RA 


where C[3] ~ 1.5 x 10^? using (35), bo = 11, and Azp is defined in (17). This expression 
is consistent with the 5D calculation (10). If the step approximation is improved then the 
exponent in e3^58 is reduced by approximately 3096 to become ~ es”. 

Note that the running of the gauge coupling with instanton size, a,(1/p) has only 


been crudely approximated in (44). The actual running coupling is given by: 


K(p) 


2m 2m R 
——M = ——--cb5n—-b In pm;, , 
as(1/p) a,Q/R) p >, 
2r R 1 R 
| Eds bg | os, 4 
ao ^ OK (^ 2 Kx) — M5) 


where K(p) is the number of KK levels lighter than 1/p, and brg = iN is the KK 
contribution to the 6-function arising from a massive gauge boson [32]. In the second 
line of (45) we have used the fact that in flat space m, = n/R and pa ln pit, = 
== — iln £. Using (45) clearly makes the integrand in (44) larger for 1/A; € p < R and 
therefore the analytic expression in (44) is a conservative lower limit. Thus to obtain a 
sizeable contribution to the integral in (44), the exponential suppression e ast) must 
be overcome by having a large number of KK states, A5 R. 


B.1 Fermion contributions 


Next we consider fermions propagating in the 5th dimension compactified on an orbifold. 
For each chiral SM field we need to introduce a Dirac fermion in the 5D bulk. Thus 
assuming Ny quark flavors there are 2N; Dirac fermions at each KK level. Using the 
results in [22] the effective action for SU(N) becomes: 


Satis d LIU 2 (zo - : ) x (5 n QN z 2) n7 . (46) 
where bg = 7 is the QCD contribution from the gauge boson and fermion zero modes. 
Compared to the pure YM case given in (42) the enhancement in the exponential factor 
e-?«t from the power-law R/p term is now reduced as the number of flavors increases. In 
fact for N = 3 there is only an enhancement for IN; < 2. 

The fermion zero modes again lead to a suppression in the instanton vacuum diagrams 
due to Yukawa couplings and Higgs loops (assuming the Higgs is confined to the bound- 
ary). Using (44) and the suppression factor in (22) the axion mass ratio in the case of 
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bulk fermions then becomes: 


3 (am t AR 
Ma 2&,C [3] 2n (m,-c m4) 1 e 3Veam 2*3 575 id 
TQCD — 7 a,(1/R) JUN Thine (Aeneas C 


where (23) has been used and Ny in this expression refers to the number of bulk fermion 
flavors in the fundamental representation. In particular when Ny = 0 there are only 
boundary fermions (with bọ = 7) and we are consistent with the relation (24). Recall that 
for the 4D deconstruction we have used the step approximation, and corrections to this 
method will reduce the pure YM exponent from 3/2 to ~ 1. 
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We show that axions interacting with abelian gauge fields obtain a potential from loops of mag- 
netic monopoles. This is a consequence of the Witten effect: the axion field causes the monopoles 
to acquire an electric charge and alters their energy spectrum. The axion potential can also be 
understood as a type of instanton effect due to a Euclidean monopole worldline winding around 
its dyon collective coordinate. We calculate this effect, which has features in common with both 
nonabelian instantons and Euclidean brane instantons. To provide consistency checks, we argue 
that this axion potential vanishes in the presence of a massless charged fermion and that it is ro- 
bust against the presence of higher-derivative corrections in the effective Lagrangian. Finally, as a 
first step toward connecting with particle phenomenology and cosmology, we discuss the regime in 
which this potential is important in determining the dark matter relic abundance in a hidden sector 
containing an abelian gauge group, monopoles, and axions. 


I. INTRODUCTION 


Axions are naturally light scalar bosons that are of 
great interest in solving the strong CP problem [1-4], as 
dark matter candidates [5-7], and for many other appli- 
cations. It is well known that instanton effects can gen- 
erate a potential for an axion 0 [3, 4] when it is coupled 
to a nonabelian gauge field via the topological coupling 
Otr(F ^ F). Even in the absence of axion interactions 
with gauge fields, it is known that Euclidean branes can 
give rise to axion potentials [8-12]. Here, we argue that 
axions coupled to abelian gauge fields through a OF A F 
coupling acquire a potential through an instanton effect 
whenever there are monopoles magnetically charged un- 
der F, due to the Witten effect [13]. Like nonabelian 
instantons, these effects are associated with 4d gauge the- 
ory dynamics. Like Euclidean branes, they occur within 
a well-behaved semiclassical expansion free of infrared 
divergences. In fact, we expect that our instantons are 
continuously connected to, or a limiting case of, known 
instanton effects in specific UV completions [14]. The 
virtue of our approach is that, by working from the bot- 
tom up, we deduce that such effects must exist [15] even 
when we do not know the UV theory.! 

'The Completeness Hypothesis postulates that any UV- 
complete theory of an interacting U(1) gauge field (which 
has quantized charge) contains magnetic monopoles [17], 
which break a would-be 1-form global symmetry [18]. 
This is, in particular, believed to be true of all theories 
of quantum gravity [19-22]. Assuming the validity of the 
Completeness Hypothesis, the instanton effect that we 
discuss will give rise to an effective potential for any axion 
interacting with photons. This is of great phenomenolog- 


1 The existence of the instantons we discuss here has been noted 
previously by Jake McNamara [16] and communicated to MR in 
the course of writing [14], although neither considered computing 
an axion potential from them at the time. 


ical interest, since the OF ^ F interaction is the primary 
target of experimental searches for axions [23-26]. 


We consider an effective theory of a periodic axion field 
0 = 6+ 2r coupled to a gauge field A normalized such 
that the allowed Wilson lines P[exp(iq$.A)] have integer 
charge q € Z: 


bo 1 ko 


The axion-gauge field coupling is of Chern-Simons type, 
with quantized coefficient k € Z. Through the Witten 
effect, a magnetic monopole in the presence of a nonzero 
background 0 acquires an electric charge —k0/(27). A 
consistent description of this effect requires that the ef- 
fective theory on the magnetic monopole worldvolume 
contains, in addition to the usual translational zero 
modes x, a collective coordinate interacting with the 
field 0. This takes the form of a compact scalar boson 
c & g + 2m, with an action that (expanding around a 
monopole worldline extended in time) contains [27]: 


1 0 
S -f [gledae Ardao + 2, 04€ ; (2) 


where the gauge-covariant derivative dac = do + kA 
respects a shift of c under A gauge transformations. The 
mode c behaves as a quantum particle on a circle (see, 
e.g., App. D.1 of [28]). Its energy eigenstates, labeled 
by integers n € Z, correspond to dyonic states of the 
monopole with electric charge k (n — 6/27) and energy 


1 8M? 


There is a monodromy n — n + 1 when 0 — 0 + 27 that 
ensures the spectrum of the theory is periodic. 


We can estimate 1, by comparing (3) to the energy of 
the classical field configuration outside a monopole in an 
axion background, following [29], from which we obtain: 


An 
eu 
where re = 7/(e?m,) is the classical radius of the mag- 
netic monopole (of mass my) and ro = ke/(8m?f) is 
the length scale over which the axion field is screened 
near the monopole core. In the special case of critical 
't Hooft-Polyakov monopoles [30, 31], we begin with an 
SU(2) gauge theory with coupling g. Matching to (1) 
gives e — g/2 and k — 2, while matching to (2) (when 
re > ro) gives lo = my/m2, where my = 47v/g and 
Mw = gv is the W boson mass. (We have chosen the 
order-one coefficient in (4) to be accurate for this case, 
but it will differ in general theories.) 

Because the dyon energy spectrum (3) is 6-dependent, 
we can integrate out the dyons and obtain an effective 
potential for 0. This can be understood either as a sum 
of Coleman-Weinberg-type potentials [32] from each dyon 
mode n, or as a sum over loops with nontrivial winding 
of o around the loop. These two calculations are related 
by Poisson resummation, as we explain below. Although 
there is prior work on the 0 potential generated by a gas 
of (non-virtual) monopoles and antimonopoles (see [29, 
33-35] and follow-ups), the effect of monopole loops on 
the vacuum 0 potential is, as far as we know, absent from 
the prior literature. 


lo ~ r, = max(re, ro), (4) 


II. MONOPOLE LOOPS 


We would like to compute the vacuum energy in 
the presence of "fundamental" magnetic monopoles. 
Schematically, the vacuum energy should be derived by 
computing a Euclidean path integral of the form 


z(0- M 


worldlines 


D(fields) e Selfields, worldlines, 6] : (5) 


and taking the limit of infinite spacetime volume, 
ice uL 
Ver (0) = — lim 5 log Z(8). (6) 


The worldline formalism has previously been applied to 
other physical processes involving monopoles, e.g., to pair 
production in magnetic fields [36]. 

In the limit where interactions between the configu- 
rations are small, we expect the partition function to 
be dominated by disconnected vacuum paths character- 
ized by the transition amplitude Zg:(0), the Feynman- 
weighted sum over all paths that are topologically a circle 
S!. These contributions exponentiate: 

I 
2(0) = 9 (2s1)" = exp(Zsi(8)). ^ (0 


n=0 


Hence Veg (0) = -$ Zs:(0); we work in the first-quantized 
picture to compute the amplitude Za:(0) [37]. We sum 


over all trajectories that return to the same configura- 
tion. This includes an integral over the invariant length 
(Schwinger proper time) 7, weighted with a 1/27 to ac- 
count for overcounting trajectories related by transla- 
tions and reflections. So, 


?? dr 
Zgi =} z; Z080), (8) 
0 T. 


with Z(r,0) the sum over transition amplitudes at fixed 
0 of all trajectories with invariant length T. 

There are two ways we can compute Zgi. For a free 
particle of mass m, the gauge fixed transition amplitude 
for a trajectory of length 7 from point x to point x” is 


(a |a) = snr) exo( 1 


(2 — x")? er) (9) 
2T 
After integrating over all trajectories that begin and end 
at the same point and canceling off the a factor of the 
spacetime volume from the measure with the factor in 
the definition of the effective potential, we obtain 


Vie = lx 1 m?r (10) 
Sj, r Anr PV 3 J^ 


We will sum over all dyon modes, labeled by n € Z. 
To simplify the computation, we assume that the dyon 
mass spectrum takes the form 


ü 2 My 
pe ; Man = pu . (11) 


m = m +m (n 
This agrees with (3) to order 1/l;, and in certain cases 
is an exact consequence of a BPS condition. In general, 
there may be power corrections in (Mulo) +. Summing 
over the tower of states, we obtain the effective potential 
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(12) 
Periodicity in 0, arising from the sum over n, is manifest 
after Poisson resummation: 
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The effective potential then becomes 
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After integrating, the result is 


Vor (0) = > Bags 27lmu/ma cos(f9) x 
9m 3mÀà 
14 i s 5 
— 2n/my mo (15) 


where we have ignored the irrelevant constant from the 
divergent / — 0 integral. 


We can think of the integer £ as the number of times 
the coordinate c winds around itself for a particular con- 
figuration, and so we expect that we can interpret the ef- 
fective potential (14) in terms of the monopole wordline 
action. Indeed, if we consider the relativistic completion 
of (2) with the dyon collective coordinate c treated as an- 
other (compact) spatial direction in which the monopole 


propagates, analogous to the DBI action: 


dz, dz” I, (dac V? 0 
Su = dA = l 57440, 
" =m f | dA S UE Es AN 
then we can compute the transition amplitude for a tra- 
jectory of length 7 from point (x,c) to point (x', o^), 


Again, we integrate over all trajectories that begin and 
end at the same point, this time getting a contribution 
from the sum over windings o’ — o = 2z4£, which nicely 
reproduces (15). 

We can understand the exponential factor in (15) via a 
saddle point approximation for each £, corresponding to a 
classical Euclidean instanton solution that winds £ times 
in the o coordinate while remaining at constant z". The 
saddle is at Schwinger proper time 7 = 27¢/(myma). 
The instanton action, which controls the convergence of 
the Fourier expansion (15), is 


2mm, An? 
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S= (18) 
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Remarkably, for the critical ’t Hooft-Polyakov monopole, 
the instanton action is S = 87?/g?, precisely that of the 
classical BPST instanton in Yang-Mills theory [38, 39]! 


A. Light and Massless Fermions 


As is familiar from standard instanton physics, the 
presence of light, charged fermions can dramatically alter 
a theory's 0-dependence. In particular, any dependence 
on 0 should vanish as we take any charged fermion’s mass 
to zero and thus restore a chiral symmetry. 

Such light fermions similarly affect dyonic physics. 
In the presence of a fermion of mass m < my, the 
dyon's electric charge will no longer be localized to its 
core [40, 41] but will rather be dispersed in the fermionic 
vacuum on a length scale of order m !. As m — 0, this 
cloud grows to encompass all of space and an observer at 
finite distance would measure vanishing electric charge: 
this cloud screens the charge induced by the Witten ef- 
fect. Furthermore, there exists a collection of fermionic 
excitations about the dyon that must be accounted for 
when computing our monopole loops. 

While a full analysis of this effect—and the inclusion 
of multiple light fermions—is reserved for future work, 
we can easily understand how it impacts the dyon mass 
spectrum on dimensional grounds. Since the fermion di- 
lutes the induced electric charge over a region roughly the 


size of its Compton wavelength, we expect that rą ~ m7! 


in the estimate (4), and so the dyonic mass spacing be- 
comes of order mi ~ mym. Since this spacing vanishes 
as m — 0, so does the 0-dependence of the dyon tower. 


B. Higher-Derivative Corrections and Validity 


Our calculation assumed the dyon mass spectrum pre- 
sented in (11), which we expect to receive corrections 
in effective field theory when monopoles are not BPS. 
We should check that our result is robust against such 
corrections. These corrections can arise from higher 
derivative operators in the bulk effective Lagrangian, like 
(Fu F^"? or (E, EY’, or higher powers of (0,,0+kA,,) 
in the worldline Lagrangian. These are related: the for- 
mer add B^, (E - B)?, B?E? and E^ terms to the energy 
density p. Integrating p outside the monopole core, sim- 
ilarly to the logic that led us to (4), implies that these 
terms modify the dyon mass spectrum. A series of terms 
of the form c3; E77 /A40-U in p generates corrections to 
the mass spectrum in even powers of (n — 6/27): 


oo 0 2j 
m2 - mimi (»- x) ; (19) 


j=1 


where Ao; ~ Ca; [e?k?/ (1672(r,.A)*) | is small when 
j»1 (and A, = 1, by the definition of m2). Terms 
involving powers of Don B and E give subleading shifts 
to the definitions of m2, m3, and the A;. 

Repeating our earlier logic, we can sum the loop cor- 
rections (10) using the mass spectrum (19). Poisson re- 
summation and relabeling n — 6/27 — n then gives 
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'To evaluate the integral over n, we work in a saddle point 
approximation: defining S;(n) to be the function inside 
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FIG. 1. 


Pink regions: axion oscillates with the monopole loop-induced mass dominating over the monopole background- 


induced mass before CMB formation at temperature Trec. Green solid line: the sum of axion abundance (Qah?) and monopole 
abundance (Qy:h?) today is Ou A? = 0.12, above which the abundance overcloses the Universe. Black dashed lines indicate the 
fraction of axion dark matter in the total abundance of axion and monopole today. Below the blue line, the axion's mass is so 
small that it never oscillates. Left panel: monopole yield saturates the Kibble bound. Right panel: monopole yield is from a 
second order phase transition with a critical exponent v = 0.5. We fix mure = r/e, the critical temperature to be Te = 1/re, 


and f = 10? GeV. 


the exponent in (20), we ask that dSe(n..)/dn = 0. Treat- 
ing A4 as a perturbation, we find that 


2ril ant N? 
n=- zina ( z-) +O(A2, A6). — (21) 
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We require a small correction to the subsequent integral 
over 7, dominated by the saddle at 7. = 27¢/(myma). 
In particular, m7. < 1 (for small £), which calls for 
caution: the semiclassical approximation requires suffi- 
ciently small A2;. In particular, (21) implies that a small 
correction to n, at the saddle Tą requires 


a 
where the last inequality can be derived from (4) and 
the surrounding discussion. Along similar lines, we re- 
quire |Az;| < (ma/my)?9-)). Focusing only on power 
counting in e, this requires that |co;| < e?9-). This will 
always hold when the operator E?/ is generated through 
loops of charged particles, as in the Euler-Heisenberg La- 
grangian, where |co;| ~ e7//(1677). In the case with 
T, = Te, this verifies that our semiclassical calculation 
can be performed within the context of a sensible ef- 
fective field theory in which higher derivative operators 
produce controllably small corrections. The case rą = ro 
requires somewhat more care regarding the allowed range 
of the scale A, which we will not delve into here. 
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III. PHENOMENOLOGICAL APPLICATIONS 


There could potentially be many interesting phe- 
nomenological implications of this new monopole contri- 
bution to the axion potential, which calls for future work. 
Here we will only consider a hidden sector with a gauged 
U(1)a symmetry and a gauge coupling e as a minimal 
example to show that this new contribution could play 
an important role in the cosmological evolution of axion- 
like particles. In the hidden sector, there is also a global 
U(1)pq symmetry which is broken spontaneously at the 
scale f and results in a Goldstone boson, the axion a. 

In the presence of monopoles carrying magnetic charge 
under the U(1)a, the axion obtains a mass from both the 
temperature-independent monopole loop, which is the 
new finding of our paper, and the temperature-dependent 
contribution from a monopole background as discussed 
in [29, 33-35]. The monopole background could be gen- 
erated via the Kibble-Zurek mechanism in a phase tran- 
sition happening at a critical temperature Te in the early 
Universe [42, 43]. The monopole yield satisfies the Kib- 
ble lower bound [42] and could be significantly above the 
bound if the phase transition is second order [43, 44]. In 
this model, both axions and monopoles could be com- 
ponents of dark matter. The relevant parameter space 
is shown in Fig. 1, assuming that the visible and dark 
sectors share a common temperature at early times. We 

see that when the gauge coupling e is large, e Z 0.5, the 
monopole loop-induced axion mass would dominate over 
the contribution from the monopole background before 


the CMB formation. In addition, the axion abundance 
is negligible when e € 0.5, but it could take over that 
of the monopoles at larger values e = 0.5. In order not 
to overclose the Universe, the monopoles must not be 
very heavy [45, 46]. Fig. 1 establishes that the new ef- 
fect we discuss can modify the cosmology of axions and 
monopoles; it would be interesting to incorporate it in a 
wider range of models in the future. 


IV. CONCLUSIONS 


In this article, we have presented and computed a 
new contribution to the vacuum axion potential from 
magnetic monopole loops, when the axion is coupled 
to an abelian gauge field. Much more remains to be 
studied, both in developing the formalism and explor- 
ing the phenomenological and cosmological implications. 
We briefly comment on some possible future directions: 
1) We have assumed V.g(0) is dominated by a single 
monopole loop, but there are long-range Coulomb in- 
teractions between the monopoles. Their effect on the 
semiclassical expansion should be explored. 2) We found 
that the action of the monopole-loop instanton in the 
critical 't Hooft-Polyakov case is that of a BPST instan- 
ton, 81?/g?. This may be a harbinger of a stronger state- 
ment: we expect that the monopole-loop instantons can 
be continuously deformed into nonabelian instantons. If 
not, the theory would have an unbroken global (—1)- 
form U(1) symmetry in 4d, and a (d — 5)-form symme- 
try in higher dimensions [14]. Similarly, in cases where 
U(1) gauge fields arise on D(n + 3)-branes wrapped on 
n-cycles in extra dimensions, one obtains axion poten- 
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Abstract 


We consider the U(1) problem within the AdS/CFT framework. 
We explain how the Witten-Veneziano formula for the 7’ mass is re- 
lated to a generalized Green-Schwarz mechanism. The closed string 
mode, that cancels the anomaly of the gauged U(1) axial symmetry, 
is identified with the r/ meson. In a particular set-up of D3-branes on 
a C3/(Z3 & Z3) orbifold singularity, the n’ meson is a twisted-sector 
R-R field. 


1 Introduction 


The U(1) problem was considered as one of the major issues in particle 
physics in the 70's. The dynamical breaking of the Ur(3) x Un(3) chiral sym- 
metry to the diagonal Uy(3) should result in nine Goldstone bosons, whereas 
only a light octet is observed in nature. The 7’ meson that correspond to the 
breaking of the UA(1) symmetry is too heavy to be a Goldstone. 

It was understood that, at the qualitative level, the resolution of the 
puzzle should involve the ABJ anomaly. It was later suggested by 't Hooft 
that the 7’ meson becomes massive because of instantons [1]. A different 
solution, which will be reviewed here, was suggested by Witten [2] and by 
Veneziano [3]. They showed that the 7’ should be massless in the 't Hooft 
large-N limit (the planar theory), since the anomaly is a 1/N effect, and that 
a 1/N mass for the 7’ is generated when the anomaly is taken into account. 
Their analysis resulted in the celebrated “Witten—Veneziano formula” (WV) 
[2, 3]: 

2 AN; d 
TOU f 6r? 

In this note we would like to derive the WV formula (1) from the AdS/CFT 
correspondence [4]. Aspects of chiral symmetry breaking within the 
^AdS/CFT with flavor" approach were discussed recently in [5]'. As we shall 
see the derivation links the WV formula to the Green-Schwarz (GS) mech- 
anism [6]: the closed string that is needed to cancel the U4(1) anomaly will 
be identified with the 7’ meson, see fig. 1. 

The set-up that we will use is D3-branes on orbifold singularities. Al- 
though we will use a specific orbifold, we suggest that this phenomenon is 
common to a generic set-up of D-branes on orbifold singularities. In particu- 
lar it does not require supersymmetry. In fact, it should be even more general: 
in string theory there are no global symmetries. If the set-up involves the 
field-theory U(1) axial symmetry as a symmetry of string theory?, it must be 
local. Local symmetries cannot be anomalous and hence the anomaly must 
be canceled via a generalized GS mechanism. The intermediate (odd parity 
R-R) closed string mode, which is involved in the anomaly cancellation, is 
the 7’ meson. Another remark is that we are taking the standard AdS/CFT 


| d'z (tr FF (2) tr FF(0))|w,-o - (1) 


! Note that in those models the 7’ is massless, since the limit Ns/N. — 0 is assumed. 
?The U(1) axial symmetry might be an “accidental symmetry” as well. 
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Figure 1: a. The Green-Schwarz mechanism. b. The string theory diagram: 
the R-R closed string mode is identified with the 7’ meson. 


decoupling limit: a’ — 0, while keeping R?/a’ fixed. The proposed GS 
mechanism “survives” this limit, as the anomaly is a field theory effect. 
While our analysis is general, it would be interesting to make an explicit 
calculation of the 7’ mass in the resolved C?/(Za & Z3) orbifold model (in 
units of the string tension). 
The rest of this note is organized as follows: in section 2 we describe our 
model and in section 3 we show how, within our specific set-up, we can derive 


the WV formula. 


2 D3 branes on C?/(Z3 & Z3) orbifold 
singularity 


In this section we set the general framework. Consider, as in [7], the type 
IIB string with a stack of N D3-branes placed on a C?/T orbifold singularity. 
In particular we will be interested in the C?/(Z3@Zs) orbifold. This model, as 
well as the general case of D3-branes on C?/(Z, & Zi») orbifold singularities, 
were analyzed in [8]. 

The conformal field theory on the D3-branes is a supersymmetric chiral 
U?(N) gauge theory with bi-fundamental matter. Therefore, there are mixed 
anomalies of the form U;(1)SU7 (N) (in fact, some combinations of U(1)'s are 
anomaly-free). 

The resolution of this local anomaly problem in the C?/(Z3& Z3) model is 
via a generalized GS mechanism [9]. The anomalous U(1)’s receive a mass of 
the string scale and therefore they are infinitely massive from the field theory 
point of view. The low-energy gauge group is thus SU?(N) (plus additional 
anomaly-free U(1)'s). In addition the local anomalies are canceled via an 
exchange of massless closed strings, see fig. 1. In the above model the closed 
strings modes are twisted-sector R-R fields. The terms that give mass to 
the U(1)'s and cancel the anomaly are WZ-terms, which are localized on the 
world-volume of the D3-branes: 


nz C ^exp PF. (2) 


Indeed, the result of the detailed analysis of ref.[8] is that there are six 
R-R (and NS-NS) twisted-sector moduli. They live on C € C?, hence they 
are six-dimensional. 


Uy(np)x U(N.)x Un(ny) 
3 chirals o m] 1 
3 chirals 1 o Li 


Table 1: The simplified model: a U(N.) gauge theory with fundamental and 
anti-fundamental matter. 


In the AdS/CFT framework (the near-horizon limit of the configuration), 
we consider type IIB string theory on AdS; x S?/(Za x Z3). The R-R twisted 
sector moduli that cancel that U(1) anomalies live on AdS; x St. 

The above model is satisfactory for most of our purposes; however, in 
order to have a clearer picture, we would still like to make two simplifications. 
We wish to turn off the coupling of all but one local U(N.) group. In the 
string theory set-up, combinations of gauge couplings are controlled by NS- 
NS moduli. By varying the six-dimensional twisted NS-NS moduli, we can 
achieve the limit where the coupling of eight of the nine gauge factors is 
much smaller than the coupling of the ninth. This is clearly seen in the 
T-dual picture [8]. 

The second simplification that we wish to make is to change the rank of 
the symmetry group so that we will have a Ur(ny) x U(N.) x Un(ny) group. 
String theory allows us to do so by adding fractional branes at the orbifold 
singularity. A different way of achieving an arbitrary number of flavors, 
including the situation with no flavors at all, is to start with the T-dual 
picture of D5-branes and NS5-branes and to vary the number of D5-branes 
[8]. Thus we will consider the model in table 1. 

Since the two groups U,(1) and Ug (1) are global (or very weakly coupled), 
the twisted-sector R-R fields will not lift their masses. Only the mass of the 
center of the U(N.), which can be identified with the Ug(1) baryon number, 
is lifted by the mixed anomalies SU?(n;)Ug(1) and SUg(n;)Up(1). Thus the 
model we consider here is a SU(N.) gauge theory with Ny = 3n; fundamental 
fermions (and scalars) and Ny anti-fundamentals fields. Actually, when all 
(but the gauge) interactions are turned off, the global symmetry is enhanced 
to SU(Ny). The GS action, in terms of two R-R fields, denoted by Cr and 
Cn, takes the following form (see ref. [10] for an explicit derivation) 


a! 


1 
g- / Ps (zea - Cn)? + 5(OCr, + Cr — «Ar) 
doy f 
+ f de zb - Catr FF. (3) 
8r? 


Note that the anomaly is proportional to Ny. The action Cr — C1 + ay, 
(Cr > Cg + ag) corresponds to a chiral rotation of the fundamental (anti- 
fundamental) fermions. A somewhat simpler way of writing (3) is in terms 
of the combinations C4 = CL — Cg and Cy = CL + Cg 


S nz L Laca)? + lioc, —trA)?) + DPC err (4) 
a’ \2 2 82? 


The symmetry C4 — C'a +a, which became global once the gauge couplings 
of the Ur(nr), Un(ny) gauge groups were turned off, is the axial symmetry. 
Note the similarity of (4) to the QCD effective action for the n [11] (see also 


/ 


[12]), upon the identification C4 = =n. 


3 A derivation of the Witten—Veneziano 
formula 


We now wish to derive the WV formula (1) from the AdS/CFT correspon- 
dence. In the AdS/CFT framework, color singlets are described by closed 
strings [13]. We suggest that the n’ meson of the C?/(Z3& Z3) model, namely 
the lightest pseudo-scalar meson, is the twisted R-R field C4. The arguments 
in favor of this suggestion are clear: the action Ca — Ca + o is the gener- 
ator of the axial symmetry. Moreover, the field C4 is a pseudo-scalar and 
it couples to the anomaly with a strength N,/N., as we expect from the y’ 
3. Finally, we will show below that the mass of the C4 field admits the WV 
formula (1). 

We start by an evaluation of the two-point function (tr F F(x) tr F F(0)). 
The AdS/CFT prescription for calculating two-point functions (O(x) O(0)) 
in the boundary field theory is to evaluate the propagators of the bulk closed 


?In our set-up, the ratio Ny/N- is kept fixed, while Ne — oo; the 1’ is therefore expected 
to have a non-vanishing mass (in contrast to the set-ups in refs.[5]). 


5 


strings that couple to the boundary operator O(x) [4]. There are two bulk 
fields that couple to tr FF: the twisted sector R-R field C4 (3) and the ten 
dimensional R-R 0-form C (the “axion”). In fact, it is well known that the ax- 
ion describes pseudo-scalar glueballs [14]. Thus, according to the AdS/CFT 
correspondence 


quar FPG)e FP) =P Y) Kolno) Y) Kole), 
(5 


where €,A are the couplings of the ten-dimensional R-R 0-form and the 
twisted sector R-R 0-form to O = tr F F, respectively. K(a,0) is a boundary- 
to-boundary propagator. In the following we will identify the C-modes and 
the C'4-modes with glueballs and flavor singlet mesons respectively. 

The propagator of a massless bulk field in a confining supergravity back- 
ground is the same as that of a free massive field in a flat space [13]. It is 
seen by taking the ansatz K(x;r) = e"*F(r) and by solving the 5-d bulk 
wave equation. Indeed, for a background of the form 


C modes CA modes 


ds? = dr?  f(r)dz;, (6) 
The bulk wave-equation takes the form 
f !(r)8, (f7(r)0,F (r)) —@gF(r)=0. (7) 


The eigenvalues q? are interpreted as the 4-d mass of the color-singlet hadron 
and thus the boundary-to-boundary propagators, in momentum space, take 


the form 
D. 1 


zu mum Ap wd 
q?— Mé 


K = = —— 
C A q? = M2, 


(8) 
Since the theory contains massless quarks 4 
f d*z (tr FF(x) tr FF(0)) — 0. (9) 


(There is no meaning to a theta vacuum, since we can use a chiral rotation to 
set the value of theta to an arbitrary value). Let us proceed as in [2]. In the 
^The gauginos, on the other hand, are assumed to have a small mass. It can easily be 


realized in the AdS/CFT framework by an appropriate perturbation of the super-gravity 
solution. 


version of our theory without matter, namely when N;/N, = 0, the quantity 
J dx (tr F F(x) tr FF(0)) is non-vanishing. We therefore must conclude that 
in eq.(5) the contribution from the mesons cancels that from the glueballs: 


M 


: | ~ 
N far 

fia = 
Neo =-0 M, 


(1672)? E xe? (tr FF(z) tr FF(0)) 


(10) 
In fact, when N/N. — 0 the sum over mesons (C4-modes) is dominated 
by the lightest mode whose mass tends to zero since in this limit the anomaly 
vanishes and C'a becomes a true Goldstone. In this limit C4 does not couple 
to the boundary field theory and (7) is solved by F(r) — const.. 
The second ingredient that is needed to complete our derivation is the 
anomaly equation. From (4) we can read the equation of motion for C4 


1 Ne ccs 
9:9 Ca = ped (11) 


We therefore identify + OKC A with the axial current J^. The coupling of the 
r.h.s. of (11) to Ca is by definition 2NA. Following ref.|2] we argue that 
the coupling of ô, J” to C4 is proportional to Mê, with a proportionality 
parameter fa. The reason is that, in momentum space, the coupling of 
J” should be ~ q” (the momentum is the only vector in the problem) and 
therefore the coupling of q, J” is ~ q? = MZ,. Thus 


2N,À = fa Mê, . (12) 
Inserting (12) in (10) 


2 
y - 4N; 1 


CR (er? 9) 


nz e (tr FF(x)tr FF(0)|v; |, ., 

W794 =0 
we arrive at the WV formula (1), provided that C4 is identified with 7’ and 
its coupling square fa with N,/2. 

Note added: I have been informed by J. Barbon of a related work [15], 
where the WV formula is discussed within the framework of “AdS/CFT with 
flavor". 
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Abstract 


One may argue that QCD solves the strong CP problem by itself, without having to introduce 
new symmetries and particles. To test this idea, a lattice simulation is performed. The problem is 
investigated in the CP? model first. It is found that the model has a first order phase transition 
in 0 from a confining phase at small 0 to a deconfining phase at large 0, and that the critical value 
of 0 decreases towards zero as B is taken to infinity. This suggests that 0 is tuned to zero in the 
continuum limit. Preliminary studies of the SU(2) Yang-Mills theory in four dimensions show a 
phase transition in @ as well, so that it is quite likely that the strong CP problem in QCD is solved 


along the same line. 


In QCD, as well as in other theories that possess 
instantons, the proper vacuum states are superpositions 
of vacua of different winding numbers n: 


e) = Y exp(iðn)|n}, 0 € [0, 27). 


n 


(1) 


These so-called 8 vacua are realized by adding a CP 
violating term to the action, 

Se = S — 10Q, (2) 
where $ is the standard action and Q is the topological 
charge. A priori Ó is a free parameter. Since no CP 
violation has been observed in the strong interactions, 0 
must however be very close to zero. + This constitutes 
the strong CP problem. The present upper bound is 
0 « 10-? [1]. 

A popular picture of the QCD vacuum is that 
of a dual superconductor [2] in which color magnetic 
monopoles condense and color electric charges, Le. 


t In the presence of quarks the 0 angle can be rotated into a phase 
of the quark mass matrix M and vice versa, due to the UA(1) 
anomaly. The effective angle is in this case 0 = 0 + argdet M. 


quarks and gluons, are confined by a dual Meissner 
effect. This picture has been successfully tested in 
lattice simulations [3]. In the @ vacuum these monopoles 
acquire a color electric charge of the magnitude 0/27 
[4]. For 0 Z 0 one would expect that the long-range 
color electric forces are screened by monopoles and that 
confinement is lost. So QCD would only be a viable 
(continuum) theory for 0 = 0. 

What makes a lattice simulation at non-vanishing 
values of 0 very difficult is the fact that the action is 
complex. As a result, standard lattice techniques are not 
immediately applicable. This has led us to investigate 
the problem in a simpler model first. 

A model, which in many respects is similar to QCD, 
is the CP- model in two space-time dimensions. 
The CP^-! model deals with N-component, complex 
scalar fields z4(x) of unit length: Za(£)za(£) = 1, a = 
1,---,N. Out of these fields one constructs composite 
vector fields 


0.001 n o? 
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Figure 1. The free energy F (0) (a), the charge density q(@) (b) and the string tension ø (e, 8) (c) as a function of 
0 on the V = 64? lattice at 8 = 2.7. The solid curve in (a) is the prediction of the large-N expansion to leading 
order. Only the first half of the 0 interval is displayed. In the second half of the interval F(0) = F(2x — 6). 


Then the action [5] can be written [6] 


= g | &«D, Duas (4) 


where D, = 0, + iA,. Thus the model describes a 
set of charged scalar fields interacting minimally with a 
composite gauge field. The topological charge 1s given 


by 


1 1 
Q= ae | ds) = z; | df (5) 
The 8 dependence of the theory is governed by the 
partition function 


Z(0) = $ ^ exp(i0Q) p(Q) = exp(-V F(0)), (6) 
Q 


where p(Q) is the probability of finding a field 
configuration with charge Q, and F(6) is the free energy 
per space-time volume V. In terms of F(0) the average 
topological charge density 1s given by 


tagos -uo (n 


and the string tension of two external particles of charge 
e and —e (in units of the intrinsic charge) turns out to 
be 

o(e,0) = F(8 + 2re)— F(0). (8) 


We have chosen to investigate the CP? model. For 
details of the calculation see ref. [7, 8]. 


If our idea 1s correct, we should find a first. order 
phase transition in @ from a confining phase to a Higgs 
or Coulomb phase. On a finite lattice and at a finite 
value of 8 the phase transition is expected to occur at 
a value 0 = 6,(8, V) > 0, where V now is the lattice 
volume. Only on an infinite lattice and at 9 = oo, Le. 
in the continuum limit, would we expect that 0e = 0. 

A first order phase transition manifests itself in a 
kink in the free energy, as well as in a discontinuity in 
the first derivative of the free energy, i.e. the topological 
charge density. In Fig. 1 I show F(@) on the V = 64? 
lattice at @ = 2.7. For comparison I also show the 
prediction of the large-N expansion to leading order 
[6], F(@) = const. -0?. We see a distinctly marked 
kink at 0 = 0, z 0.57: while F(0) increases roughly 
proportional to 0? up to 0 = 8e, F(0) turns out to be 
constant (within the error bars) for 0 > 0e. In Fig. 1 
I also show the topological charge density on the same 
lattice. According to (6) this can be interpreted as a 
background electric field. We see that q(0) increases 
almost linearly with @ up to 0 = 0., where it jumps to 
zero and then stays zero over the rest of the interval 
(again within the error bars). Thus the phase transition 
is marked by a collapse of the background electric field, 
presumably due to pair production. 

To show that the phase transition is a deconfining 
phase transition, I have plotted the string tension in Fig. 
1, again on the 64? lattice at @ = 2.7, for two different 
charges. To keep the discussion simple, I have restricted 
myself to the case of small fractional charges. This 
matters because the vacuum will change its properties 
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Figure 2. The critical value 0:(8, V) as a function of V ^1 for 
three values of 6. The lines are a linear fit to the data points. 


if it is exposed to a strong external electric field. We see 
that the string tension is zero (within the error bars) for 
0 > 0e. This result is a consequence of the property that 
F(0) is constant for 0 > 0.. The string tension reaches 
its peak at 0 = 0. — 27e. 

In Fig. 2 I show all our results for 0,(8, V) for 
a variety of lattice volumes ranging from V = 28? to 
V = 200? and for three values of @: 8 = 2.5, 8 = 2.7 
and 8 = 2.9. The lattice data display strong finite size 
effects. For a first order phase transition we expect 


6.(8, V) — 6:(8,o0) x V~" (9) 


for fixed 8. When 6,(8,V) is plotted against V ^1, we 
find that for all three values of @ our data fall on a 
straight line, in accordance with the predictions of a first 
order phase transition. This allows us to extrapolate the 
lattice results to the infinite volume. 

We find that the extrapolated values of 0, fall rapidly 
to zero as f is taken to infinity. Our results are 
consistent with a decay like 0, c 1/£, where € is the 
correlation length. (For our values of 8 the correlation 
length is 4.5, 8.8 and 18.5, respectively.) In the strong 
coupling limit the model 1s known to have a first order 
phase transition at 6 = 0, = « [9]. This suggests a phase 
diagram of the form shown in Fig. 3. 

Thus 0 = 0 is the only value at which one can take 
the continuum limit, at least within the confining phase. 
This resolves the strong CP problem. 

As a next step towards solving QCD, we are 
currently investigating the problem in the SU(2) Yang- 
Mills theory in four dimensions [10]. Preliminary results 
indicate a first order phase transition in 0, just as in 
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Figure 3. The phase diagram. The horizontal line at 0 = 7 is 
the strong coupling prediction, and the symbols are the lattice 
data extrapolated to infinite volume. 


the case of the CP? model. Because of lack of space 
I cannot show a figure here. To establish the phase 
diagram firmly, we will have to repeat the calculation 
on lattices of various sizes and for several values of 8 
still, as we did for the CP? model. 
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